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COMPOUNDS AND METHODS FOR THERAPY 
AND DIAGNOSIS OF LUNG CANCER 

TECHNICAL FIELD 

The present invention relates generally to compositions and methods for the 
treatment and diagnosis of lung cancer. The invention is more specifically related to 
nucleotide sequences that are preferentially expressed in limg tumor tissue, together with 
polypeptides encoded by such nucleotide sequences. The inventive nucleotide sequences and 
polypeptides may be used in vaccines and pharmaceutical compositions for the treatment and 
diagnosis of lung cancer. 

BACKGROUND OF THE INVENTION 

Lung cancer is the primary cause of cancer death among both men and women 
in the U.S., with an estimated 172,000 new cases being reported in 1994. The five-year 
survival rate among all lung cancer patients, regardless of the stage of disease at diagnosis, is 
only 13%. This contrasts with a five-year survival rate of 46% among cases detected while 
the disease is still localized. However, only 16% of lung cancers are discovered before the 
disease has spread. 

Early detection is difficult since clinical symptoms are often not seen until the 
disease has reached an advanced stage. Currently, diagnosis is aided by the use of chest x- 
rays, analysis of the type of cells contained in sputum and fiberoptic examination of the 
bronchial passages. Treatment regimens are determined by the type and stage of the cancer, 
and include surgery, radiation therapy and/or chemotherapy. In spite of considerable research 
into therapies for the disease, lung cancer remains difficult to treat 

Accordingly, there remains a need in the art for improved vaccines, treatment 
methods and diagnostic techniques for lung cancer. 

SUMMARY OF THE INVENTION 

Briefly stated, the present invention provides compounds and methods for the 
therapy of lung cancer. In a first aspect, isolated polynucleotide molecules encoding lung 
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more such polypeptides, fusion proteins or polynucleotide molecules in combination with an 
immime response enhancer. 

In related aspects, the present invention provides methods for inhibiting the 
development of lung cancer in a patient, comprising administering to a patient an effective 
amount of at least one of the above pharmaceutical compositions and/or vaccines. 

\ Additionally, the present invention provides methods for immunodiagnosis of 
lung cancer, together with kits for use in such methods. Polypeptides are disclosed which 
comprise at least an inununogenic portion of a lung tumor protein or a variant of said protein 
that differs only in conservative substitutions and/or modifications, wherein the lung tumor 
protein comprises an amino acid sequence encoded by a polynucleotide molecule having a 
sequence selected from the group consisting of nucleotide sequences recited in SEQ ID NO: 
M09, 111, 113, 115-151, 153, 154, 157, 158, 160, 162-164, 167, 168 and 171, and variants 
thereof. Such polypeptides may be usefidly employed in the diagnosis and monitoring of 
limg cancer. 

In one specific aspect of the present invention, methods are provided for 
detecting lung cancer in a patient, comprising: (a) contacting a biological sample obtained 
from a patient with a binding agent that is capable of binding to one of the above 
polypeptides; and (b) detecting in the sample a protein or polypeptide that binds to the 
binding agent. In preferred embodiments, the binding agent is an antibody, most preferably a 
monoclonal antibody. 

In related aspects, methods are provided for monitoring the progression of 
lung cancer in a patient, comprising: (a) contacting a biological sample obtained from a 
patient with a bindmg agent that is capable of binding to one of the above polypeptides; (b) 
determining in the sample an amount of a protein or polypeptide that binds to the binding 
agent; (c) repeating steps (a) and (b); and comparing the amounts of polypeptide detected in 
steps (b) and (c). 

Within related aspects, the present invention provides antibodies, preferably 
monoclonal antibodies, that bind to the inventive polypeptides, as well as diagnostic kits 
comprising such antibodies, and methods ofi using such antibodies to inhibit the development 
of lung cancer. 
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present invention are also provided. These and other aspects of the present invention will 
become apparent upon reference to the following detailed description. All references 
disclosed herein are hereby incorporated by reference in their entirety as if each was 
incorporated individually. 

DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to compositions 
and methods for the therapy and diagnosis of lung cancer. The compositions described herein 
include polypeptides, fusion proteins and polynucleotide molecules. Also included within 
the present invention are molecules (such as an antibody or fragment thereof) that bind to the 
inventive polypeptides. Such molecules are referred to herein as "binding agents." 

In one aspect, the subject invention discloses polypeptides comprising an 
immunogenic portion of a human lung tumor protein, >?sdierein the lung tumor protein 
includes an amino acid sequence encoded by a polynucleotide molecule including a sequence 
selected from the group consisting of (a) nucleotide sequences recited in SEQ ID NO: 1-109, , 
111, 113 115-151, 153, 154,157, 158, 160, 162-164, 167, 168 and 171, (b) the complements 
of said nucleotide sequences, and (c) variants of such sequences. As used herein, the term 
"polypeptide" encompasses amino acid chains of any length, including full length proteins, 
wherein the anuno acid residues are linked by covalent peptide bonds. Thus, a polypeptide 
comprising a portion of one of the above Ixmg tumor proteins may consist entirely of the 
portion, or the portion may be present within a larger polypeptide that contains additional 
sequences. The additional sequences may be derived from the native protein or may be 
heterologous, and such sequences may (but need not) be inmnmoreactive and/or antigenic. 
As detailed below, such polypeptides may be isolated from Ixmg tumor tissue or prepared by 
synthetic or recombinant means. 

As used herein, an "iimnxmogenic portion" of a limg tumor protein is a portion 
that is capable of eliciting an inmnme response in a patient inflicted with lung cancer £md as 
such binds to antibodies present within sera from a lung cancer patient. Such immunogenic 
portions generally comprise at least about 5 amino acid residues, more preferably at least 
about 10, and most preferably at least about 20 amino acid residues. Inununogenic portions 
of the proteins described herein may be identified in antibody binding assays. Such assays 
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exhibit at least about 70%, more preferably at least about 90% and most preferably at least 
about 95% identity (determined as describe below) to the identified polypeptides. 

As used herein, a "conservative substitution" is one in which an amino acid is 
substituted for another amino acid that has similar properties, such that one skilled in the art 
of peptide chemistry would expect the secondary structure and hydropathic nature of the 
polyp^tide to be substantiaUy unchanged. In general, the following groups of amino acids 
represent conservative changes: (1) ala, pro, gly, glu, asp, ghi, asn, ser, thr; (2) cys, ser, tyr, 
thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, tip, his. 

Variants may also, or alternatively, contain other modifications, including the 
deletion or addition of amino acids that have minimal mfluence on the antigenic properties, 
secondary structure and hydropathic nature of the polypeptide. For example, a polypeptide 
may be conjugated to a signal (or leader) sequence at the N-terminal end of the protein which 
co-translationally or post-translationally directs transfer of the protein. The polypeptide may 
also be conjugated to a linker or other sequence for ease of synthesis, purification or 
identification of the polypeptide (e.g., poly-His), or to enhance binding of the polypeptide to a 
solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc 
region. 

A nucleotide ^Variant" is a sequence that differs fix)m the recited nucleotide 
sequence m having one or more nucleotide deletions, substitutions or additions. Such 
modifications may be readily introduced using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis as taught, for example, by Adelman et al. 
(DNA, 2:183, 1983). Nucleotide variants may be naturally occurring alleUc variants, or non- 
naturally occurring variants. Variant nucleotide sequences preferably exhibit at least about 
70%, more preferably at least about 80% and most preferably at least about 90% identity 
(determined as described below) to the recited sequence. 

The antigens provided by the present invention include variants that are 
encoded by polynucleotide sequences which are substantially homologous to one or more of 
the polynucleotide sequences specifically recited herein. "Substantial homology," as used 
herein, refers to polynucleotide sequences that are capable of hybridizing under moderately 
stringent conditions. Suitable moderately stringent conditions include prewashing in a 
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Preferably, the "percentage of sequence identity" is determined by comparing 
two optimally aligned sequences over a window of comparison of at least 20 positions, 
wherein the portion of the polynucleotide sequence in the comparison window may comprise 
additions or deletions (i.e. gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 
percent, as compared to the reference sequences (which does not comprise additions or 
deletions) for optimal alignment of the two sequences. The percentage is calculated by 
determining the number of positions at which the identical nucleic acid bases or amino adid 
residue occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the reference sequence (i.e. 
the window size) and multiplying the results by 100 to yield the percentage of sequence 
identity. 

Also included in the scope of the present invention are alleles of the genes 
encoding the nucleotide sequences recited in herein. As used herein, an "allele" or "allellic 
sequence" is an alternative form of the gene which may result from at least one mutation in 
the nucleic acid sequence. Alleles may result in altered mRNAs or polypeptides whose 
structure or function may or may not be altered. Any given gene may have none, one, or 
many allelic forms. Common mutational changes which give rise to alleles are generally 
ascribed to natural deletions, additions, or substitutions of nucleotides. Each of these types of 
changes may occur alone or in combination with the others, one or more times in a given 
sequence. 

For lung tumor polypeptides with immunoreactive properties, variants may, 
alternatively, be identified by modifying the amino acid sequence of one of the above 
polypeptides, and evaluating the immunoreactivity of the modified polypeptide. For lung 
tumor polypeptides useful for the generation of diagnostic binding agents, a variant may be 
identified by evaluating a modified polypeptide for the ability to generate antibodies that 
detect the presence or absence of lung cancer. Such modified sequences may be prepared and 
tested using, for example, the representative procedures described herein. 

The lung tumor polypeptides of the present invention, and polynucleotide 
molecules encoding such polypeptides, may be isolated from lung tumor tissue using any of a 
variety of methods well known in the art. Polynucleotide sequences corresponding to a gene 
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cDNA molecule can be generated by ligating suitable fragments, using well known 
techniques. 

Alternatively, there are numerous amplification techniques for obtaining a full 
length coding sequence from a partial cDNA sequence. Within such techniques, 
amplification is generally performed via PCR. Any of a variety of commercially available 
kits may be used to perform the amplification step. Primers may be designed using 
techniques well known in the art {see, for example, Mullis et al.. Cold Spring Harbor Symp. 
Quant. BioL 57:263, 1987; Erlich ed., PCR Technology, Stockton Press, NY, 1989), and 
software well known in the art may also be employed. Primers are preferably 22-30 
nucleotides in length, have a GC content of at least 50% and anneal to the target sequence at 
temperatures of about 68°C to 72°C. The amplified region may be sequenced as described 
above, and overlapping sequences assembled into a contiguous sequence. 

One such amplification technique is inverse PCR {see Triglia et al., Nucl 
Acids Res, /5;8186, 1988), which uses restriction enzymes to generate a firagment in the 
known region of the gene. The fragment is then cfrcularized by intramolecular ligation and 
used as a template for PCR with divergent primers derived from the known region. Within an 
alternative approach, sequences adjacent to a partial sequence may be retrieved by 
amplification with a primer to a linker sequence and a primer specific to a known region. The 
amplified sequences are typically subjected to a second round of amplification with the same 
linker primer and a second primer specific to the known region. A variation on this 
procedure, which employs two primers that initiate extension in opposite directions ftom the 
known sequence, is described in WO 96/38591. Additional techniques include capture PCR 
(Lagerstrom et al., PCR Methods Applic, 7:111-19, 1991) and walking PCR (Parker et al., 
NucL Acids, Res, 7P:3055-6O, 1991). Transcription-Mediated Amplification, or TMA is 
another method that may be utilized for the amplification of DNA, rRNA, or mRNA, as 
described in Patent No. PCT/US91/03184. This autocatalytic and isothermic non-PCR based 
method utilizes two primers and two enzymes: RNA polymerase and reverse transcriptase. 
One primer contains a promoter sequence for RNA polymerase. In the first amplification, the 
promoter-primer hybridizes to the target rRNA at a defined site. Reverse transcriptase creates 
a DNA copy of the target rRNA by extension fix)m the 3'end of the promoter-primer. The 
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acick, and generally fewer than about 50 amino acids, may be generated using techniques 
well known to those of ordinary skill in the art. For example, such polypeptides may be 
synthesized using any of the commercially available soHd-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain (see, for example, Merrifield, J. Am. Chem, Soc, <9i:2 149-2 146, 
1963). Equipment for automated synthesis of polypeptides is commercially available fi-om 
suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and may be 
operated according to the manufacturer's instructions. 

In addition, lung tumor antigens may be identified by T cell expression 
cloning. One source of tumor specific T cells is from surgically excised tumors from human 
patients. In one method for isolating and characterizing tumor specific T cells, the excised 
tumor is minced and enzymatically digested for several hours to release tumor cells and 
infiltrating lymphocytes (tumor infiltrating T cells, or TILs). The cells are washed in HBSS 
buffer and passed over a FicoU (100%/75%/HBSS) discontinuous gradient to separate tumor 
cells and lymphocytes from non-viable cells. Two bands are harvested from the interfaces; 
the upper band at the 75%/HBSS interface contains predominantly tumor cells, while the 
lower band at the 100%/75%/HBSS interface contains a majority of lymphocytes. The TILs 
are expanded in culture by techniques well known in the art, but preferably in culture media 
supplemented with 10 ng/ml IL-7 and 100 U/ml IL-2, or alternatively, cultured and expanded 
in tissue culture plates that have been pre-adsorbed With anti-CD3 monoclonal antibody 
(0KT3). The resulting TIL cultures are analyzed by FACS to confirm that the vast majority 
are CD8+ T cells (>90% of gated population). 

In addition, the tumor cells are also expanded in culture using standard 
techniques well known in the art to establish a tumor cell line, which is later confirmed to be 
lung carcinoma cells by inununohistochemical analysis. The tumor cell line is transduced 
with a retroviral vector to express human CD80. The tumor cell line is fiirther characterized 
by FACS analysis to confirm the strong expression levels of CD80, class I and II MHC 
molecules. 

The specificity of the TIL lines to lung tumor is confirmed by INF-y and/or 
TNF-a cytokine release assays. For example, TIL cells from day 21 cultures are co-cultured 
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are incorporated into pharmaceutical compositions or vaccines for use in one or more of the 
methods disclosed herein. 

In a related aspect, the present invention provides fusion proteins comprising a 
first and a second inventive polypeptide or, alternatively, a polypeptide of the present 
invention and a known lung tumor antigen, together with variants of such fusion proteins. 
The fusion proteins of the present invention may (but need not) include a linker peptide 
between the first and second polypeptides. 

A polynucleotide sequence encoding a fusion protein of the present invention 
is constructed using known recombinant DNA techniques to assemble separate 
polynucleotide sequences encoding the first and second polypeptides mto an appropriate 
expression vector. The 3' end of a DNA sequence encoding the first polypeptide is ligated, 
with or without a peptide linker, to the 5' end of a DNA sequence encoding the second 
polypeptide so that the reading fi^es of the sequences are in phase to permit mRNA 
translation of the two DNA sequences into a smgle fusion protem that retains the biological 
activity of both the first and the second polypeptides. 

A peptide linker sequence may be employed to separate the first and the 
second polypeptides by a distance sufficient to ensure that each polypeptide folds into its 
secondary and tertiary structures. Such a peptide linker sequence is incorporated into the 
fusion protem using standard techniques well known in the art. Suitable peptide linker 
sequences may be chosen based on the following factors: (1) their ability to adopt a flexible 
extended conformation; (2) their inability to adopt a secondary structure that could interact 
with functional epitopes on the first and second polypeptides; and (3) the lack of hydrophobic 
or charged residues that might react with the polypeptide functional epitopes. Preferred 
peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, 
such as Thr and Ala may also be used in the linker sequence. Amino acid sequences which 
may be usefiilly employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 
1985; Murphy etal., Proc, Natl Acad Sci. USA 55:8258-8262, 1986; U.S. Patent 
No. 4,935,233 and U.S. Patent No. 4,751,180. The linker sequence may be fi-om 1 to about 
50 amino acids in length. Peptide sequences are not required v^en the first and second 



BNSDOCID: <WO 9947B74A2 I > 



spnpin sj30irei(U3 asuodsai sunuiun-ogpsds-uou jo saidurexg *TO3inre snouaSoxa ub 
ssuodsaj aunuiun ire SuiooBqua jo Satiion^ ^iq^dBO si jsouBqas asuodsaj sunuraii ogiosds 
-uou 9ip uiaiaqAA 'jsouBqua osuodsai-aumiiun ogioads-uou b pro ssppdadXiod qons sioui 
JO 3U0 Qsuduioo Xbui S3U10DBA 3qx ojqBidoooB Xi|BOT3oioxs>(qd B pire *(}oaJ9tp sihbuba 

jo) s30U9nb3s aAoqB sip jo ajom Jo auo areiuoo Xbui qotqM jo qoBs ^sappdadyCjod moui 
JO OTO osuduioo Xbiu suopisoduioo [BopnaoBULrcq J outooba b jo uoTjisodtuoo [BopraDBuuBqd 
B mqjiM juasaid XhbjsusS st 3pi;dadXiod dApudAui aqj ^s^osdsB assqj 

•s3mp opnsdBjaqiomsqo [BUopuaAuoo puB XdBJsqjoipBJ jo uoijBJjsniimpB Xq 
;u3imB3Ji JO/puB sjounu XjBumd jo psAOUiai iBOiSjns 3uimo[joj jo o; joud jsqira paiajsnnuipB 
XiqBJ9jajd 3JB spunoduioo aqi -jsoubo 3xmi jo ^TOuidopAsp aq; iiqiqui o; jo j3oubo 
3an| iBOQ 01 pasn 9q Xbui uxaiaq pssoiosip spunoduioo sqi 'Xi3uipjooov ssBOSip aiqBjDsjap jo 
aag aq Xbui jo '3SB9Sip qjiM papiQgB oq Xbui luoijBd y UBumq b XiqBioj^jd *iBunuB popoo[q 
-uuBM XuB 01 sjapj „judpBd„ b *uTOJoq pssn sy luaijBd b m jsoubo 3uni jo XdBJsqjoununut 
joj (supiojd uoisry jo ssinoaioui sppoapnuXiod *s3ppdodXiod aq Xbui qoiqM) 
uiaisq psquosap spunoduioo 3qj jo aioui jo ouo 3uisn joj spoq^sui sapiAOJd snqi uoiiuqaui 
ju9S3jd sqx sipo jouinj 3uni o; osuodsai aunuiun umo s,iu3pBd sqj sajBinuii^ sppdadXjod 
3qj uiaiaqM *j3oubo 3uni jo XdBJsqj joj pasn 9q XipisusS Xbui uisjojd jouim Sunj b jo 
uoiyod oiuaSounuiuii ub asudoioo jBqj uopuoAUi ^uasojd aqj jo ssppdodXioj 

((L661) 16 

-9S''9Sf "'Pm T l3u3 Maj^ -jb ja omo;s *3lduiBX9 joj *93s) suiajojd sppBdaq puB sisoinojaqm 
'snuBjoj apnpui supjojd qons jo sajdurexg ssuodssj iposj b Suppip jo ajqBdBO si upjojd 
oiusSounuiuii 9qi X[qBJOjojj upiojd orasSounuiuix pajBpjun ub qjiAv jaqjsSoi uopusAUi 
luasojd aqj jo 3ppd9dX{od b osudmoo ^Bq; papiAOJd osib 9jb supjojd uoisnj 

•9ppd3dX[od puooQS sqi Suipoouo aouonbas VNQ ,£ irasojd Xpio 3jb sjbuSis 
uopBUTUusi uopduosuBJj puB uopBjsuBjj pu9 o; sjinfa^j suopoo do^ 'XiJBiiuns -sapKidadXiod 
;sjij sqi 3uipoou9 souanbos vNQ ^T* oi ,$ X[uo pojBooi oib oppoopnuXjod jo uoissajdxs 
JOJ 9iqisuods3J sjusuiop XjojBfnSoj aqx s^uouiop XjoiBpSsj puopBisuBJi jo puouduosuBJj 
aiqBjms oj p3>iuTi XfqBjado 3jb saousnbos oppoapnuXjod p3jB3ii aqx 

'30U9J0JJ0;U1 OUOJS lU3A9Jd puB SUIBUIOp jBuouounj 9qi 

siBJBdos oj pssn 9q ubo jBqj suoiSaj ppB ouiuib p3Uiuu9i-{v[ jBpussso-uou 9ABq sappdsdXpd 

91 



86^S0/66Sa/lD<i 



PL9LPf66 OM 



wo 99/47674 



PCT/US99/05798 



17 

adjuvants, biodegradable microspheres (e.g., polylactic galactide) and liposomes (into which 
the polypeptide is incorporated). Pharmaceutical compositions and vaccines may also contain 
other epitopes of lung tumor antigens, eitiier incorporated into a fusion protein as described 
above (i.e., a single polypeptide that contains multiple epitopes) or present within a separate 
polypeptide. 

■\ Alternatively, a pharmaceutical composition or vaccine may contain 
poljmucleotide encoding one or more of the above polypeptides and/or fusion proteins, such 
that the polypeptide is generated in situ. In such pharmaceutical compositions and vaccines, 
the polynucleotide may be present within any of a variety of delivery systems known to those 
of ordinary skill in the art, including nucleic acid expression systems, bacteria and viral 
expression systems. Appropriate nucleic acid expression systems contain the necessary 
polynucleotide sequences for expression in the patient (such as a suitable promoter). 
Bacterial delivery systems involve the administration of a bacterium (such as Bacillus- 
Calmette-Guerrin) that expresses an epitope of a lung cell antigen on its cell surface. In a 
preferred embodiment, the polynucleotides may be introduced using a viral expression system 
(e.g., vaccinia or other pox virus, retrovirus, or adenovirus), which may involve the use of a 
non-pathogenic (defective), replication competent virus. Suitable systems are disclosed, for 
example, in Fisher-Hoch et al., PNAS 55:317-321, 1989; Flexner et al., Ann KY. Acad. Scl 
559:86-103, 1989; Flexner et al.. Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 
4,769,330, and 5,017,487; WO 89/01973; U.S. Patent No, 4,777,127; GB 2,200,651; 
EP 0,345,242; WO 91/02805; Berkner, Biotechniques 5:616-627, 1988; Rosenfeld et al.. 
Science 252:431-434, 1991; KoUs et al., PNAS 97:215-219, 1994; Kass-Eisler et al., PNAS 
90:11498-11502, 1993; Guzman et al.. Circulation 55:2838-2848, 1993; and Guzman et al., 
C/r. Res. 75:1202-1207, 1993. Techniques for incorporating polynucleotide into such 
expression systems are well known to those of ordinary skill in the art. The polynucleotides 
may also be "naked," as described, for example, in published PCT application WO 90/1 1092, 
and Uhner et al.. Science 259:1745-1749, 1993, reviewed by Cohen, Science 259:1691-1692, 
1993. The uptake of naked polynucleotides may be increased by coating the polynucleotides 
onto biodegradable beads, which are efBcientiy transported into the cells. 
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available as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco 
Laboratories, Detroit, MI) and Merck Adjuvant 65 (Merck and Company, Inc., Rahway, 
NJ).Polypeptides and polynucleotides disclosed herein may also be employed in adoptive 
immunotherapy for the treatment of cancer. Adoptive immimotherapy may be broadly 
classified into either active or passive immimotherapy. In active immimotherapy, treatment 
relies dn the in vivo stimulation of the endogenous host immune system to react against 
tumors with the administration of immune response-modifying agents (for example, tumor 
vaccines, bacterial adjuvants, and/or cytokines). 

In passive immunotherapy, treatment involves the delivery of biologic 
reagents with established tumor-immune reactivity (such as effector cells or antibodies) that 
can directly or indirectly mediate antitumor effects and does not necessarily depend on an 
intact host immune system. Examples of effector cells include T lymphocytes (for example, 
CD8+ cytotoxic T-lymphocyte, CD4+ T-helper, gamma/delta T lymphocytes, tumor- 
mfiltrating lymphocytes), killer cells (such as Natural Killer cells, lymphokine-activated killer 
cells), B cells, or antigen presenting cells (such as dendritic cells and macrophages) 
expressing the disclosed antigens. The polypeptides disclosed herein may also be used to 
generate antibodies or anti-idiotypic antibodies (as in U.S. Patent No. 4,918,164), for passive 
immunotherapy. 

The predominant method of procuring adequate numbers of T-cells for 
adoptive immunotherapy is to grow immune T-cells in vitro. Culture conditions for 
expanding single antigen-specific T-cells to several billion in number with retention of 
antigen recognition in vivo are well known in the art. These in vitro culture conditions 
typically utilize intermittent stimulation with antigen, often in the presence of cytokines, such 
as IL-2, and non-dividing feeder cells. As noted above, the immunoreactive polypeptides 
described herein may be used to r^idly expand antigen-specific T cell cultures in order to 
generate sufficient number of cells for immunotherapy. In particular, antigen-presenting 
cells, such as dendritic, macrophage, monocyte, fibroblast, or B-cells, may be pulsed with 
immunoreactive polypeptides, or polynucleotide sequence(s) may be introduced into antigen 
presenting cells, using a variety of standard techniques well known in the art. For example, 
antigen presenting cells may be transfected or transduced with a polynucleotide sequence, 
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Standard techniques and the cells are administered back to the patient. 

In other embodiments, T-cell and/or antibody receptors specific for the 
polypeptides disclosed herein can be cloned, expanded, and transferred into other vectors or 
effector cells for use in adoptive immunotherapy. In particular, T cells may be transfected 
with the appropriate genes to express the variable domains fit)m tumor specific monoclonal 
antibodies as the extracellular recognition elements and joined to the T cell receptor signaling 
chains, resulting in T cell activation, specific lysis, and cytokme release. This enables the T 
cell to redirect its specificity in an MHC-independent manner. See for example, Eshhar, Z., 
Cancer Immunol Immunother, 45(3-4):131-6, 1997 and Hwu, P., et al. Cancer Res, 
55(15):3369-73, 1995. Another embodiment may include the transfection of tumor antigen 
specific alpha and beta T cell receptor chains into alternate T cells, as in Cole, DJ, et al. 
Cancer Res, 55(4):748-52, 1995. 

In a fiirther embodiment, syngeneic or autologous dendritic cells may be 
pulsed with peptides corresponding to at least an immunogenic portion of a polypeptide 
disclosed herein. The resulting antigen-specific dendritic cells may either be transferred into 
a patient, or employed to stimulate T cells to provide antigen-specific T cells which may, in 
turn, be administered to a patient. The use of peptide-pulsed dendritic cells to generate 
antigen-specific T cells and the subsequent use of such antigen-specific T cells to eradicate 
tumors in a murine model has been demonstrated by Cheever et al. Immunological Reviews, 
757:177, 1997). 

Furthennore, vectors expressing the disclosed polynucleotides may be 
introduced into stem cells taken fix^m the patient and clonally propagated in vitro for 
autologous transplant back into the same patient 

Additionally, vectors expressing the disclosed polynucleotides may be 
introduced into stem cells taken fix)m the patient and clonally propagated in vitro for 
autologous transplant back into the same patient. Polypeptides and fiision proteins of the 
present invention may also, or alternatively, be used to generate binding agents, such as 
antibodies or fiagments thereof, that are capable of detecting metastatic human lung tumors. 
Binding agents of the present invention may| generally be prepared using methods known to 
those of ordinary skill in the art, including the representative procedures described herein. 
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polypeptides, relative to a predetermined cut-off value. As used herein, suitable "biological 
samples" include blood, sera, urine and/or lung secretions. 

The level of one or more of the above polypeptides may be evaluated xising 
any binding agent specific for the polypeptide(s). A "binding agent," in the context of this 
invention, is any agent (such as a compound or a cell) that binds to a polypeptide as described 
above, i^s used herein, "binding" refers to a noncovalent association between two separate 
molecules (each of which may be free (/.e., in solution) or present on the surface of a cell or a 
solid support), such that a "complex" is formed. Such a complex may be free or immobilized 
(either covalently or noncovalently) on a support material. The ability to bind may generally 
be evaluated by determining a binding constant for the formation of the complex. The 
binding constant is the value obtained when the concentration of the complex is divided by 
the product of the component concentrations. In general, two compounds are said to "bind" 
in the context of the present invention when the binding constant for complex formation 
exceeds about 10^ L/mol. The bmding constant may be determined using methods well 
known to those of ordinary skill in the art. 

Any agent that satisfies the above requirements may be a binding agent. For 
example, a binding agent may be a ribosome with or without a peptide component, an RNA 
molecule or a peptide. In a preferred embodiment, the binding partner is an antibody, or a 
fragment thereof Such antibodies may be polyclonal, or monoclonal. In addition, the 
antibodies may be single cham, chimeric, CDR-grafted or humanized. Antibodies may be 
prepared by the methods described herein and by other mediods well known to those of skill 
in the art. 

There are a variety of assay formats known to those of ordinary skiU in the art 
for using a binding partner to detect polypeptide markers in a sample. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In a 
preferred embodiment, the assay involves the use of binding partner immobilized on a solid 
support to bind to and remove the polypeptide torn the remainder of the sample. The bound 
polypeptide may then be detected using a second binding partner that contains a reporter 
group. Suitable second binding partners include antibodies that bind to the binding 
partner/polypeptide complex. Alternatively, a competitive assay may be utilized, in which a 



BNSDOCIO: <WO_9947574A3„L> 



'iUV'ZW *1661 *:>looqpireH pire Sopi^o iCSoiouqD^^ounuraq 
9019} J '-^ a 'aas-) jaiqiEd Suipuiq aqj uo raSojpXq sApoB ub pire sunuB tre qjiM 
uoddns 3ip uo dnoiS apXqopps ub jo uopBSuapuoo Xq jo OToumbozusq 3insn 3upBOo j^uiXiod 
a^BudoiddB ub 3uiABq syoddns oj paqoBWB iCpuspAOO sq Xbui juoSb Smpmq aqj ^oidurexa 
JO J ';u33b 3uipuiq oqj uo *dnai3 ouiure jo |XxojpXq b sb qons *dnoj3 iBuopoury b puB poddns 
aq; qjoq qjiM ;oBai {{iM ^Bq; jusSboj {Buopom^iq b q^iM voddns aqi 3upoB9J jsjg Xq psAaiqoB 
sq XijBJ3U33 iCBUi yoddns px|os b oi ^usSb Surpuiq jo jasraqoB^B ja9[BA03 

1U33b 3uipuiq jo junoure ajBnbspB ub szinqouioii oi ;u3ioijjns si '3Ti \ 
vxoqis 01 Su 001 inoqB XiqBJSjaid puB *3t1 qi jnoqB oj 3u qx ;noqB moj; 3ui3ubj iu33b 3mpuiq 
JO junouiB uB xpiM (apuoiqoiXuiAi(|od jo susjXjSiCiod sb qons) ojBfd js^iiojotui ousBjd b jo \p/A 
B 3upoBjuoo 'IBJ3U93 ui ^Bp | ^TioqB puB juoq I jnoqB uasMjaq vCi|Boidy(j si mq 'ojruBJsduioi 
\^iA\ sauBA 3Uip pBjuoo oqx ouip JO lunouiB oiqB^ms b joj ^oddns pijos oq^ qjiM 
'jsjgnq siqBims b ui *juo3b 3uipuiq oq^ SupoBjuoo yCq poAoiqoB oq Xbui uopdjospB *sosbo qons 
UI -pajjajsjd si ouBjquioui b oi jo ajBjd joipojoixu b ui ijoav b o^ uoijdjospB Aq uoiiBZijiqouiuq 
•(iu33b 3up[uii-ssojo B JO Xbm Xq 33b>[uii b oq Xbiu jo jjoddns sqj uo sdnoj3 [Buoipunj 
puB u93puB gqi uosAuaq aSBjpiq pojip b aq Xbcu qoiqM) juauiqoBjiB jusjbaoo puB ^uopdJospB 
SB qons 'uopBioossB juoibaoouou qioq oi sjojaj „uoijBzqiqoiuuii„ uuai oq; 'uoi^uoaui ^uosajd 
oqj JO JX91UO0 oqi uj -ajmBJain ogpuoios puB ^uojBd sqi ui paquossp XiduiB ojb qoiqyw 'yB oqj 
in^s JO osoqi o; UAvomj sonbiuqooj jo X^ouba b 3uisn yoddns pijos aqi uo psziiiqouiun oq 
Xbui ju93b 3uipuiq oqi •189'6S£*S 'ON ^^^^M 'S'fl «f 'siduiBxs joj ^posopsip osoqi sb qons 
U0SU9S opdo jsqij b jo oppjBd oi;ou3bui b oq osib Xboi ^oddns sqx 'opuoiqoiXuiAXpd jo 
ouojXjsX[od SB qons [buojbui oijSBid b jo xojbj ^ssBjSjoqg *ssbi3 sb qons *osip jo pBoq b oq Xbui 
yoddns oq; 'XjOAiiBUJojiv 'auBjquioui oiqBjins joipo Jo osopiipoojjiu b jo 3;B[d js^iicijoiui b 
ui [|0M isoj B oq Xbui poddns pqos aqj *o|duiBXo joj -poqoBWB oq Xbui uo3puB oqi qifqAV o; ^B 
IIPIS XiBUipjo JO osoip oi wAoxn[ fBuojBUi XuB oq Xbui yoddns pqos oqi 
"JOuyBd 3uipuiq pozijiqoumii oqj qipw ojduiBS oq^ jo Xjiaijoboj oqi jo OAi^Boipui si JOUjJBd 
3uipuiq oqi oj oppdodXiod popqBi oqj jo 3utpuiq oqj ;iqtqui sidures oqi jo sjuouoduioo 
qoiqAv OJ lu^ixd oqx 'oiduiBS oip qjiM JoujJBd 3uipuiq oq; jo uopBqnom jo^b J^u^Bd 
3utpuiq pozi|iqoaiun oqi oj puiq oj poMO[[B puB dnoj3 Jo^odaj b qiiM popqBj si oppdodXfod 



S6LS0/66SnJlDd 



PL9LPm OM 



wo 99/47674 



PCT/US99/05798 



25 

In certain embodiments, the assay is a two-antibody sandwich assay. This 
assay may be performed by first contacting an antibody that has been immobilized on a solid 
support, commonly the well of a microtiter plate, with the sample, such that polypeptides 
within the sample are allowed to bind to the immobilized antibody. Unbound sample is then 
removed from the immobilized polypeptide-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added. The amount of second antibody that remains bound to the solid support is then 
determined using a method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically blocked. 
Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum 
albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The immobilized antibody is 
then incubated with the sample, and polypeptide is allowed to bind to the antibody. The 
sample may be diluted with a suitable diluent, such as phosphate-buffered saline (PBS) prior 
to incubation. In general, an appropriate contact time {i.e., incubation time) is that period of 
time that is sufficient to detect the presence of polypeptide within a sample obtained from an 
individual with lung cancer. Preferably, the contact time is sufiBcient to achieve a level of 
binding that is at least about 95% of that achieved at equilibrium between bound and unbound 
polypeptide. Those of ordinary skill in the art will recognize that the time necessary to 
achieve equilibrium may be readily determined by assaying the level of binding that occurs 
over a period of time. At room temperature, an incubation time of about 30 minutes is 
generally sufficient. 

Unbound sample may then be removed by washing the solid support with an 
appropriate buffer, such as PBS containing 0.1% Tween 20™. The second antibody, which 
contains a reporter group, may then be added to the solid sxq)port. Preferred reporter groups 
include enzymes (such as horseradish peroxidase), substrates, cofactors, inhibitors, dyes, 
radionuclides, Ixraiinescent groups, fluorescent groups and biotin. The conjugation of 
antibody to reporter group may be achieved using standard methods known to those of 
ordinary skill in the art. 
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In a related embodiment, the assay is performed in a flow-through or strip test 
format, wherein the antibody is immobilized on a membrane, such as nitrocellulose. In the 
flow-through test, polypeptides within the sample bind to the immobilized antibody as the 
sample passes through the membrane. A second, labeled antibody then binds to the antibody- 
polypeptide complex as a solution containing the second antibody flows through the 
membraiVp. The detection of bound second antibody may then be performed as described 
above. In the strip test format, one end of the membrane to which antibody is bound is 
immersed in a solution containing the sample. The sample migrates along the membrane 
through a region containing second antibody and to the area of immobilized antibody. 
Concentration of second antibody at the area of immobilized antibody indicates the presence 
of lung cancer. Typically, the concentration of second antibody at that site generates a 
pattern, such as a line, that can be read visually. The absence of such a pattern indicates a 
negative result In general, the amount of antibody immobilized on the membrane is selected 
to generate a visually discernible pattern when the biological sample contains a level of 
polypeptide that would be sufficient to generate a positive signal in the two-antibody 
sandwich assay, in the format discussed above. Preferably, the amount of antibody 
immobilized on the membrane ranges from about 25 ng to about 1 |ig, and more preferably 
from about 50 ng to about 500 ng. Such tests can typically be performed with a very small 
amount of biological sample. 

Of course, numerous other assay protocols exist that are suitable for use with 
the antigens or antibodies of the present invention. The above descriptions are intended to be 
exemplary only. 

In another embodiment, the above polypeptides may be used as markers for 
the progression of lung cancer. In this embodiment, assays as described above for the 
diagnosis of lung cancer may be performed over time, and the change in the level of reactive 
polypeptide(s) evaluated. For example, the assays may be performed every 24-72 hours for a 
period of 6 months to 1 year, and thereafter performed as needed. In general, lung cancer is 
progressing in those patients in whom the level of polypeptide detected by the binding agent 
increases over time. In contrast, lung cancer is not progressing when the level of reactive 
pol>T>eptide either remains constant or decreases with time. 
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such as injection of the hybridoma cell line into the peritoneal cavity of a suitable vertebrate 
host, such as a mouse. Monoclonal antibodies may then be harvested from the ascites fluid or 
the blood. Contaminants may be removed from the antibodies by conventional techniques, 
such as chromatography, gel filtration, precipitation, and extraction. The polypeptides of this 
invention may be used in the purification process in, for example, an affinity chromatography 
step. \ 

Monoclonal antibodies of the present invention may also be used as 
therapeutic reagents, to diminish or eliminate lung tumors. The antibodies may be used on 
their own (for instance, to inhibit metastases) or coupled to one or more therapeutic agents. 
Suitable agents in this regard include radionuclides, differentiation inducers, drugs, toxins, 
and derivatives thereof Preferred radionuclides include '^Re, '^^Re, ^"At, 

and ^'^Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred toxins 
include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin. Shigella 
toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable 
monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct reaction 
between an agent and an antibody is possible when each possesses a substituent capable of 
reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl 
group, on one may be capable of reacting with a carbonyl-containing group, such as an 
anhydride or an acid halide, or with an alkyl group containing a good leaving group (e.g., a 
halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an antibody 
fipom an agent in order to avoid interference with binding capabilities. A linker group can 
also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and 
thus increase the coupling efBciency. An increase in chemical reactivity may also facilitate 
the use of agents, or functional groups on agents, which otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional or 
polyfunctional reagents, both homo- and hetero-functional (such as those described in the 
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nitrogen and sulfur atoms as the donor atoms for binding the metal, or metal oxide, 
radionuclide. For example, U.S. Patent No. 4,673,562, to Davison etal. discloses 
representative chelating compounds and their synthesis. 

A variety of routes of administration for the antibodies and immunoconjugates 
may be used. Typically, administration will be intravenous, intramuscular, subcutaneous or 
in the v^d of a resected tumor. It will be evident that the precise dose of the 
antibody/immunoconjugate will vary depending upon the antibody used, the antigen density 
on the tumor, and the rate of clearance of the antibody. 

Diagnostic reagents of the present invention may also comprise polynucleotide 
sequences encoding one or more of the above polypeptides, or one or more portions thereof 
For example, at least two oligonucleotide primers may be employed in a polymerase chain 
reaction (PGR) based assay to amplify lung tumor-specific cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for a polynucleotide 
molecule encoding a lung tumor protein of the present invention. The presence of the 
amplified cDNA is then detected using techniques well known in the art, such as gel 
electrophoresis. Similarly, oUgonucleotide probes specific for a polynucleotide molecule 
encoding a lung tumor protein of the present invention may be used in a hybridization assay 
to detect the presence of an inventive polypeptide in a biological sample. 

As used herein, the term "oligonucleotide primer/probe specific for a 
polynucleotide molecule" means an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to the 
polynucleotide molecule in question. Oligonucleotide primers and/or probes which may be 
usefully employed in the inventive diagnostic methods preferably have at least about 10-40 
nucleotides. In a preferred embodiment, the oligonucleotide primers comprise at least about 
10 contiguous nucleotides of a polynucleotide molecule comprising sequence selected fi-om 
SEQIDNO: 1-109, 111, 113 115-151, 153, 154, 157, 158, 160, 162-164, 167, 168 and 171. 
Preferably, oligonucleotide probes for use in the inventive diagnostic methods comprise at 
least about 15 contiguous oligonucleotides of a polynucleotide molecule comprising a 
sequence provided in SEQ ID NO: 1-109,111, 113 115-151, 153, 154, 157, 158, 160, 162- 
164, 167, 168 and 171. Techniques for both PGR based assays and hybridization assays are 
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site of pcDNA3.1 (Invitrogen) and transfonned into ElectroMax E, coli DHIOB cells (BRL 
Life Technologies) by electroporation. 

Using the same procedure, a normal human limg cDNA expression library was 
prepared from a pool of four tissue specimens. The cDNA libraries were characterized by 
determining the number of independent colonies, the percentage of clones that carried insert, 
the averiage insert size and by sequence analysis. The lung squamous cell carcinoma library 
contained 2.7 x 10^ independent colonies, with 100% of clones having an insert and the 
average insert size being 2100 base pairs. The normal lung cDNA library contained 1.4 x 10* 
independent colonies, with 90% of clones having inserts and the average insert size being 
1800 base pairs. For both libraries, sequence analysis showed that the majority of clones had 
t full length cDNA sequence and were synthesized from mRNA 

cDNA library subtraction was performed using the above lung squamous cell 
carcinoma and normal lung cDNA libraries, as described by Hara et aL {Blood, 189- 199, 
1994) with some modifications. Specifically, a lung squamous cell carcinoma-specific 
subtracted cDNA library was generated as follows. Normal tissue cDNA library (80 ng) was 
digested with BamHI and Xhol, followed by a filling-in reaction with DNA polymerase 
Klenow fragment. After phenol-chloroform extraction and ethanol precipitation, the DNA 
was dissolved in 133 ^1 of HjO, heat-denatured and mixed with 133 ^1 (133 ^ig) of 
Photoprobe biotin (Vector Laboratories, Burlingame, CA). As recommended by the 
manufacturer, the resulting mixture was irradiated with a 270 W sunlamp on ice for 20 
minutes. Additional Photoprobe biotin (67 was added and the biotinylation reaction was 
repeated. After extraction with butanol five times, the DNA was ethanol-precipitated and 
dissolved in 23 ^1 HjO to form the driver DNA. 

To form the tracer DNA, 10 ^ig lung sqxiamous cell carcinoma cDNA lilMrary 
was digested with NotI and Spel, phenol chloroform extracted and passed through Chroma 
spin-400 columns (Clontech, Palo Alto, CA). Typically, 5 ^g of cDNA was recovered after 
the sizing column. Following ethanol precipitation, the tracer DNA was dissolved in 5 pil 
HjO. Tracer DNA was mixed with 15 ^il driver DNA and 20 fil of 2 x hybridization buffer 
(1.5 M NaCl/10 mM EDTA/50 mM HEPES pH 7.5/0.2% sodium dodecyl sulfate), overiaid 
with mineral oil, and heat-denatured completely. The sample was immediately transferred 
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one sample of each tissue as described above), plus twenty other cDNA clones that were 
frequently recovered in lung subtractions I and U, as the driver DNA Gung subtraction HI). 
The normal liver and heart cDNA library contained 1.76 x 10* independent colonies, with 
100% of clones having inserts and the average insert size being 1600 base pairs. Ten 
additional clones were isolated (SEQ ID NO: 61-70). Comparison of these cDNA sequences 
with th^se in the gene bank as described above, revealed no significant homologies to the 
sequences provided m SEQ ID NO: 62 and 67. The sequences of SEQ ID NO: 61, 63-66, 68 
and 69 were found to show some homology to previously isolated ESTs and the sequence 
provided in SEQ ID NO: 70 was found to show some homology to a previously identified rat 
gene. 

B. Isolation of cDNA Sequences fi-om a Lung Adenocarcmoma Library 

A human limg adenocarcinoma cDNA expression library was constructed as 
described above. The library contained 3.2 x 10^ independent colonies, with 100% of clones 
havmg an insert and the average insert size being 1500 base pairs. Library subtraction was 
performed as described above using the normal lung and normal liver and heart cDNA 
expression libraries described above as the driver DNA. Twenty-six hundred independent 
clones were recovered. 

Initial cDNA sequence analysis fit)m 100 independent clones revealed many 
ribosomal protein genes. The cDNA sequences for fifteen clones isolated in this subtraction 
are provided in SEQ ID NO: 71-86. Comparison of these sequences with those in the gene 
bank as described above revealed no significant homologies to the sequence provided in SEQ 
ID NO: 84. The sequences of SEQ ID NO: 71, 73, 74, 77, 78 and 80-82 were found to show 
some homology to previously isolated ESTs, and the sequences of SEQ ID NO: 72, 75, 76, 
79, 83 and 85 were found to show some homology to previously identified human genes. 

Example 2 

DETERMINATION OF TISSUE SPECmCITY OF LUNG TUMOR POLYPEPTIDES 
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normal testes when the PGR was performed for 30 cycles. Lower level expression was 
detected in some normal tissues when the cycle number was increased to 35. Antigen LST- 
S3-13 (SEQ ID NO: 66) was found to be expressed in 3 out of 4 lung tumors, one breast 
tumor and both colon tumor samples. Its expression in normal tissues was lower compared to 
tumors, and was only detected in 1 out of 4 normal lung tissues and in normal tissues from 
kidney, byary and retina. Expression of antigens LST-S3-4 (SEQ ID NO: 62) and LST-S3-14 
(SEQ ID NO: 67) was rare and did not show any tissue or tumor specificity. Consistent with 
Northern blot analyses, the RT-PCT results on antigen LAT-Sl-A-lOA (SEQ ID NO: 78) 
suggested that its expression is high m lung, colon, stomach and small intestine tissues, 
including lung and colon tumors, whereas its expression was low or undetectable in other 
tissues. 

A total of 2002 cDNA fragments isolated in lung subtractions I, II and III, 
described above, were colony PGR amplified and their mRNA expression levels in lung 
tumor, normal lung, and various other normal and tumor tissues were determined using 
microarray technology (Synteni, Palo Alto, CA). Briefly, the PGR amplification products 
were dotted onto slides in an array format, with each product occupying a unique location in 
the array. mRNA was extracted from the tissue sample to be tested, reverse transcribed, and 
fluorescent-labeled cDNA probes were generated. The microarrays were probed with the 
labeled cDNA probes, the slides scanned and fluorescence intensity was measured. This 
intensity correlates with the hybridization intensity. Seventeen non-redundant cDNA clones 
showed over-expression in lung squamous tumors, with expression in normal tissues tested 
(lung, skin, lymph node, colon, liver, pancreas, breast, heart, bone marrow, large intestine, 
kidney, stomach, brain, small mtestine, bladder and salivary gland) being either undetectable, 
or 10-foId less compared to lung squamous tumors. The determined partial cDNA sequences 
for the clone L513S are provided in SEQ ID NO: 87 and 88; those for L514S are provided in 
SEQ ID NO: 89 and 90; those for L516S in SEQ ID NO: 91 and 92; that for L517S in SEQ 
ID NO: 93; that for L519S in SEQ ID NO: 94; those for L520S in SEQ ID NO: 95 and 96; 
those for L521 S in SEQ ID NO: 97 and 98; that for L522S in SEQ ID NO: 99; that for L523S 
in SEQ ID NO: 100; that for L524S in SEQ ID NO: 101; that for L525S in SEQ ID NO: 102; 
that for L526S in SEQ ID NO: 103; that for L527S in SEQ ID NO: 104; that for L528S in 
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Further analysis has demonstrated L529S (SEQ ID NO: 106 and 1 15), L525S 
(SEQ ID NO: 102 and 120) and L527S (SEQ ID NO: 104) are cytosleletal components and 
potentially squamous cell specific proteins. L529S is connexin 26, a gap junction protein. It 
is highly expressed in lung squamous tumor 9688T, and moderately over-expressed in two 
others. However, lower level expression of connexin 26 is also detectable in nonnal skin, 
colon, liver and stomach. The over-expression of connexin 26 in some breast tumors has 
been reported and a mutated fomi of L529S may result in over-expression in lung tumors. 
L525S is plakophilin 1, a desmosomal protein found in plaque-bearing adhering junctions of 
the skin. Expression levels for L525S mRNA is highly elevated in three out of four lung 
squamous tumors tested, and in nomial skin. L527S has been identified as keratin 6 isoform, 
type II 58 Kd keratin, and cytokeratin 13 and shows over-expression in squamous tumors and 
low expression in normal skin, breast and colon tissues. Notably, keratin and keratin-related 
genes have been extensively documented as potential markers for lung cancer including 
CYFRA2.1 (Pastor, A., et al, Eur, Respir. J., 10:603-609, 1997). L513S (SEQ ID NO: 87 
and 88) shows moderate over-expression in several tumor tissues tested, and encodes a 
protein that was first isolated as a pemphigus vulgaris antigen. 

L520S (SEQ ID NO: 95 and 96) and L521S (SEQ ID NO: 97 and 98) are 
highly expressed in lung squamous tumors, and L520S is up-regulated in normal salivary 
gland and L521S is over-expressed in normal skin. Both belong to a family of small proline 
rich proteins and represent markers for fiilly differentiated squamous cells. L521S has been 
described as a specific marker for lung squamous tumor (Hu, R., et al. Lung Cancer, 20:25- 
30, 1998). L515S (SEQ ID NO: 162) encodes IGF-P2 and L516S is an aldose reductase 
homologue and both are moderately expressed in lung squamous tumors and in normal colon. 
Notably, L516S (SEQ ID NO: 91 and 92) is up-regulated in metastatic tumors but not 
primary lung adenocarcinoma, an indication of its potential role in metatasis and a potential 
prognostic maricer. L522S (SEQ ID NO: 99) is moderately over-expressed in lung squamous 
tumors with minimum expression in nonnal tissues. L522S has been shown to belong to a 
class IV alcohol dehydrogenase, ADH7, and its expression profile suggests it is a squamous 
cell specific antigen. L523S (SEQ ID NO: 100) is moderately over-expressed in lung 
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nonnal human tissue cDNAs including lung, PBMC, brain, heart, kidney, liver, pancreas, and 
skin, (Clontech, Palo Alto, CA) were derived and submitted to a first round of PCR 
amplification. This library was subjected to a second round of PCR amplification, following 
the manufacturer's protocol. The resulting cDNA firagments were subcloned into the vector 
P7- Adv vector (Clontech, Palo Alto, CA) and transformed into DH5a E, coli (Gibco, BRL). 
DNA was isolated firom independent clones and sequenced using a Perkin Ehner/Applied 
Biosystems Division Automated Sequencer Model 373A. 

One hundred and sixty two positive clones were sequenced. Comparison of 
the DNA sequences of these clones with those in the gene bank using the EMBL and 
GenBank databases, as described above, revealed no significant homologies to 13 of these 
clones, hereinafter referred to as Contig 13, 16, 17, 19, 22, 24, 29, 47, 49, 56-59 . The 
determined cDNA sequences for these clones are provided in SEQ ID NO: 125, 127-129, 
131-133, 142, 144, 148-150, and 157, respectively. Contigs 1, 3-5, 7-10, 12, 11, 15, 20, 31, 
33, 38, 39, 41, 43, 44, 45, 48, 50, 53, 54 (SEQ ID NO: 1 15-124, 126, 130, 134-141, 143, 145- 
147, respectively) were found to show some degree of homology to previously identified 
DNA sequences. Contig 57 (SEQ ID NO: 149) was found to represent the clone L519S (SEQ 
ID NO: 94) disclosed in US. Patent Application No. 09/123,912, filed July 27, 1998. To the 
best of the inventors* knowledge, none of these sequences have been previously shown to be 
differentially over-expressed in lung tumors. 

mRNA expression levels for representative clones in limg tumor tissues, 
nonnal lung tissues (n=4), resting PBMC, salivary gland, heart, stomach, lymph nodes, 
skeletal muscle, soft palate, small intestine, large intestine, bronchial, bladder, tonsil, kidney, 
esophagus, bone marrow, colon, adrenal gland, pancreas, and skin, (all derived from human) 
were determined by RT-PCR as described above. Expression levels using microarray 
technology, as described above, were examined in one sample of each tissue type unless 
otherwise indicated. 

Contig 3 (SEQ ID NO: 1 16) was found to be highly expressed in all head and 
neck squamous cell tumors tested (17/17), and expressed in the majority (8/12) of lung 
squamous tumors, (high expression in 7/12, moderate in 2/12, and low in 2/12), while 
showing negative expression for 2/4 normal lung tissues and low expression in the remaining 
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Contig 22, (SEQ ID NO: 131) was shown to be expressed in most head and 
neck squamous cell tumors tested (13/17) with high expression in four of these samples, 
moderate expression in 6/17, and low expression in 3/17. Expression levels in lung 
squamous tumors were found to be moderate to high for 3/12 tissues tested, with negative 
expression in two normal lung samples and low expression in two other samples (n=4). 
Contig 2? did show low expression in skin, salivary gland and soft palate. Similarly, Contig 
24 (SEQ ID NO: 132) was found to be expressed in most head and neck squamous cell 
tumors tested (13/17) with high expression in three of these samples, moderate expression in 
6/17, and low expression in 4/17. Expression levels in lung squamous tumors were found to 
be moderate to high for 3/12 tissues tested, with negative expression for three normal lung 
samples and low expression in one sample (n=4). Contig 24 did show low expression in skin, 
salivary gland and soft palate. Contig 29 (SEQ ID NO: 133) was expressed in nearly all 
head and neck squamous cell tumors tested (16/17): highly expressed in 4/17, moderately 
expressed in 1 1/17, with low expression in one sample. Also, it was moderately expressed in 
3/12 lung squamous tumors, while being negative for 2/4 normal lung samples. Contig 29 
showed low to moderate expression in large intestine, skin, salivary gland, pancreas, toiisil, 
heart and soft palate. Contig 47 (SEQ ID NO: 142) was expressed in most head and neck 
squamous cell tumors tested (12/17): moderate expression in 10/17, and low expression in 
two samples. In lung sqxiamous tumors, it was highly expressed in one sample and 
moderately expressed in two others (n=13). Contig 47 was negative for 2/4 normal lung 
samples, with the remaining two samples having moderate expression. Also, Contig 47 
showed moderate expression in large intestine, and pancreas, and low expression in skin, 
salivary gland, soft palate, stomach, bladder, resting PBMC, and tonsil. 

Contig 48 (SEQ ID NO: 143) was expressed in all head and neck squamous 
cell tumors tested (17/17): highly expressed in 8/17 and moderately expressed in 7/17, with 
low expression in two samples. Expression levels in lung squamous tumors were high to 
moderate in three samples (n=13). Contig 48 was negative for one out of four normal lung 
samples, the remaining showing low or moderate expression. Contig 48 showed moderate 
expression in soft palate, large intestine, pancreas, and bladder, and low expression in 
esophagus, salivary gland, resting PBMC, and heart. Contig 49 (SEQ ID NO: 144) was 
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in SEQ ID NO: 170) has a two nucleotide deletion at the 3' coding region in comparison to 
SEQ ID NO: 160, resulting in a secreted form of the expressed protein. 

The fulMength cDNA sequence for contig 56 (SEQ ID NO: 148), referred to 
as L773P, is provided in SEQ ID NO: 171, with the predicted amino acid sequence in SEQ 
ID NO: 172. Subsequent Northern blot analysis of L773P demonstrates this transcript is 
diflfereritially over-expressed in squamous tumors and detected at approximately 1 .6 Kb in 
primary lung tumor tissue and approximately 1 .3 Kb in primary head and neck tumor tissue. 

Subsequent microarray analysis has shown Contig 58, also referred to as 
L769S (SEQ ID NO: 150), to be overexpressed in breast tumors in addition to lung squamous 
tumors. 



Example 4 
SYNTHESIS OF POLYPEPTIDES 

Polypeptides may be synthesized on a Perkin Ehner/Applied Biosystems 
Division 430A peptide synthesizer using FMOC chemistry with HPTU (0-Benzotriazole- 
N,N,N',N'-tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly sequence 
may be attached to the amino terminus of the peptide to provide a method of conjugation, 
binding to an inunobilized surface, or labeling of the peptide. Cleavage of the peptides from 
the solid support may be carried out using the following cleavage mixture: trifluoroacetic 
acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After cleaving for 2 hours, tfxe 
peptides may be precipitated in cold methyl-t-butyl-ether. The peptide pellets may then be 
dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized prior to 
purification by CI 8 reverse phase HPLC. A gradient of 0%-60% acetonitrile (contaimng 
0.1% TFA) m water (containing 0.1% TFA) may be used to elute the peptides. Following 
lyophilization of the pure fractions, the peptides may be characterized using electrospray or 
other types of mass spectrometry and by amino acid analysis. 



j 

! 

BNSDOaD: <WO 9947674A2 I > 



•uoi}U3AUi 9ip JO adoos puB luids 9qj mag SuT^BiAsp inoqjiM apBin aq Xbui suopBOijipoia 
snouBA 'aopBJjsnni P sasodmd sqj aoj uiajsij paquosap ussq ^ABq uopTOAin sxp jo 
s;u3uiipoqai3 ogioads qSnoq^iB 'iBq^ pa^BioajddB aq upvs. ii 'SuioSajoj sqj mojj 

9t^ 



86Z.S0/66Sn/X3<I 



wo 99/47674 PCT/US99/05798 

47 

CLAIMS: 

1 . An isolated polynucleotide molecule comprising a nucleotide sequence 
selected &om the group consisting of: 

(a) sequences provided in SEQ ID NO: 1-3, 6-8, 10-13, 15-27, 29, 30, 32, 
\ 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86- 

96, 107-109, 111, 113, 125, 127, 128, 129, 13M33, 142, 144, 148- 
151, 153, 154, 157, 158, 160, 167, 168 and 171; 

(b) the complements of sequences provided in SEQ ID NO: 1-3, 6-8, 10- 
13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69. 71, 73, 74, 
77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131- 
133, 142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168 and 171; 
and 

(c) sequences that hybridize to a sequence of (a) or (b) under moderately 
stringent conditions. 

2. An isolated polypeptide comprising an immunogenic portion of a lung 
tumor protein or a variant thereof, wherein said protein comprises an amino acid sequence 
encoded by a polynucleotide molecule of claim 1 . 

3. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide of claim 2. 

4. An expression vector comprising an isolated polynucleotide molecule 
of claims 1 or 3. 

5. A host cell transformed with the expression vector of claim 4. 

6. The host cell of claim 5 wherein the host cell is selected from the group 
consisting of E. coli, yeast and manunalian cell lines. 
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13. A vacx^ine for the treatment of lung cancer comprising a polypeptide 
and a non-specific immune response enhancer, said polypeptide comprising an immimogenic 
portion of a lung protein or a variant thereof, wherein said protein comprises an amino acid 
sequence encoded by a polynucleotide molecule comprising a sequence selected from the 
group consisting of: 

\ (a) sequences recited in SEQ ID NO: 4, 5, 9, 14, 28, 3 1, 33, 50, 53, 56, 60, 
70, 72, 75, 76, 79, 83, 85, 97-106, 115-124, 126, 130, 134-141, 143, 
145-147 and 162-164; 

(b) sequences complementary to the sequences of SEQ ID NO: 4, 5, 9, 14, 
28, 31, 33, 50, 53, 56, 60, 70, 72, 75, 76, 79, 83, 85, 97-106, 1 15-124, 
126, 130, 134-141, 143, 145-147 and 162-164; and 

(c) sequences that hybridize to a sequence of (a) or (b) under moderately 
stringent conditions. 



1 4. A vaccine for the treatment of lung cancer comprising a DN A molecule 
and a non-specific inunune response enhancer, the polynucleotide molecule comprising a 
sequence selected fi*om the group consisting of: 

(a) sequences recited in SEQ ID NO: 4, 5, 9, 14, 28, 31, 33, 50, 53, 56, 60, 
70, 72, 75, 76, 79, 83, 85, 97-106, 115-124, 126, 130, 134-141, 143, 
145-147 and 162-164; 

(b) sequences complementary to the sequences of SEQ ID NO: 4, 5, 9, 14, 
28, 31, 33, 50, 53, 56, 60, 70, 72, 75, 76, 79, 83, 85, 97-106, 1 15-124, 
126, 130, 134-141, 143, 145-147 and 162-164; and 

(c) sequences that hybridize to a sequence of (a) or (b) under moderately 
stringent conditions. 

15. A method for inhibiting the development of lung cancer in a patient, 
comprising administering to the patient an effective amount of the pharmaceutical 
composition of claims 7 or 12. 
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from the group consisting of nucleotide sequences recited in SEQ ID NO: 1-109, 111,1 13, 
115-151, 153, 154, 157, 158, 160, 162-164, 167, 168 and 171 the complements of said 
nucleotide sequences and sequences that hybridize to a sequence of SEQ ID NO: 1-109, 1 1 1, 
113, 115-151, 153, 154, 157, 158, 160, 162-164, 167, 168 and 171 under moderately stringent 
conditions; and 

\ (b) detecting in the sample a protein or polypeptide that binds to the 
binding agent, thereby detecting lung cancer in the patient. 

25. The method of claim 24 ^dierein the binding agent is a monoclonal 

antibody. 

26. The method of claim 25 wherein the binding agent is a polyclonal 

antibody. 

27. A method for monitoring the progression of lung cancer in a patient, 

comprising: 

(a) contacting a biological sample obtained from the patient with a binding 
agent that is capable of binding to a polypeptide, said polypeptide comprising an 
immunogenic portion of a lung protein or a variant thereof, wherein said protein comprises an 
amino acid sequence encoded by a polynucleotide molecule comprising a sequence selected 
from the group consisting of nucleotide sequences recited in SEQ ID NO: 1-109, 111, 113, 
115-151, 153, 154, 157, 158, 160, 162-164, 167, 168 and 171 the complements of said 
nucleotide sequences and sequences that hybridize to a nucleotide sequence of SEQ ID NO: 
1-109, 111, 113, 115-151, 153, 154, 157, 158, 160, 162-164, 167, 168 and 171 under 
moderately stringent conditions; 

(b) determining in the sample an amount of a protein or polypeptide that 
binds to the binding agent; 

(c) repeating steps (a) and (b); and 

(d) comparing the amount of polypeptide detected in steps (b) and (c) to 
monitor the progression of lung cancer in the i)atient. 
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(c) detecting in the sample a polynucleotide sequence that amplifies in the 
presence of the oligonucleotide primers, thereby detecting lung cancer. 

32. The method of claim 31, wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQIDNO: 1-109, 111, 113, 115-151, 153, 154, 157, 
158, 160, 162-164, 167, 168 and 171. 

33 . A diagnostic kit comprising: 

(a) one or more monoclonal antibodies of claim 28; and 

(b) a detection reagent. 

34. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a polynucleotide molecxile comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 4, 5, 9, 14, 28, 31, 33,"50, 53, 56, 60, 70, 72, 75, 76, 79, 83, 85, 
97-106, 115-124, 126, 130, 134-141, 143, 145-147 and 162-164 the complements of said 
sequences, and sequences that hybridize to a sequence of SEQ ID NO: 4, 5, 9, 14, 28, 31, 33, 
50, 53, 56, 60, 70, 72, 75, 76, 79, 83, 85, 97-106, 115-124, 126, 130, 134-141, 143, 145-147 
or 1 62- 1 64 under moderately stringent conditions; and 

(b) a detection reagent. 

35. The kit of claims 33 or 34 wherein the monoclonal antibodies are 
immobilized on a solid support. 

36. The kit of claim 35 wherein the solid support comprises nitrocellulose, 
latex or a plastic material. 

37. The kit of claims 33 or 34 wherein the detection reagent comprises a 
reporter group conjugated to a binding agent. 
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sequences that hybridize to a sequence ofSEQ ID NO: M 09, 111, 113, 115-151, 153, 154, 
157, 158, 160, 162-164, 167, 168 or 171 under moderately stringent conditions; and 

(c) detecting in the sample a polynucleotide sequence that hybridizes to 
the oligonucleotide probe, thereby detecting lung cancer in the patient, 

\ 43. The method of claim 42 wherein the oligonucleotide probe comprises 
at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a sequence 
selected from the group consisting of SEQIDNO: 1-109, 111, 113, 115-15L 153, 154, 157, 
158, 160, 162-164, 167, 168 and 171. 

44. A diagnostic kit comprising an oligonucleotide probe specific for a 
polynucleotide molecule encoding a polypeptide comprising an immunogenic portion of a 
lung protein or a variant thereof, said protein comprising an amino acid sequence encoded by 
a polynucleotide molecule comprising a sequence selected from the group consisting of: 
nucleotide sequences recited in SEQ ID NO: 1-109, 111, 113, 115-151, 153, 154, 157, 158, 
160, 162-164, 167, 168 and 171; the complements of said nucleotide sequences; and 
sequences that hybridize to a sequence of SEQ ID NO: 1-109, 111, 113, 115-151, 153, 154, 
157, 158, 160, 162-164, 167, 168 or 171 under moderately stringent conditions. 

45. The diagnostic kit of claim 44, wherein the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a 
sequence selected from the group consisting of SEQ ID NO: 1-109, 111, 113, 115-151, 153, 
154, 157, 158, 160, 162-164, 167, 168 and 171. 

46. A method for treating lung cancer in a patient, comprising the steps of: 

(a) obtaining peripheral blood cells from the patient; 

(b) incubating the cells in the presence of at least one polypeptide of claim 
2, such that T cells proliferate; and 

(c) administering to the patient the proliferated T cells. 
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(b) administering to the patient the incubated antigen presenting cells. 

55. A method for treating lung cancer in a patient, comprising the steps of: 
(a) incubating antigen presenting cells in the presence of at least one 

polynucleotide of claim 1 ; 

\ (b) administering to the patient the incubated antigen presenting cells. 

56. The method of claims 54 or 55 wherein the antigen presenting cells are 
selected from the group consisting of dendritic cells and macrophage cells. 

57. A composition for the treatment of lung cancer in a patient, comprising 
antigen presenting cells incubated in the presence of a polypeptide of claim 2, in combination 
with a pharmaceutically acceptable carrier. 

58. A composition for the treatment if lung cancer in a patient, comprising 
antigen presenting cells incubated in the presence of a polynucleotide of claim 1, in 
combination with a pharmaceutically acceptable carrier. 
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<220> 

<221> misc__f eature 

<222> (1) . . . (346) 

<223> n = A,T,C or G 

<400> 3 

ttgtaagtat acaattttag aaaggattaa atgtcattga ccattttact gaatactgca 60 

cac.cctcacc atacaccatc caccttccaa taacacttaa tccttcctaa aattgtaagt 120 

acaiaattgt actttctttg gattttcata acaaacatac catagactgt taattttatc 180 

gaagcctccc caatggaatg agccactctt gccttgtgct cttgaggtca ccttcgcttt 240 

gacttccaac aatttgatca tatagtgttg agctgtggaa acctttaagt ctactctata 300 

gcaataattt ctactnnnag annccnggnn naaaannann annaaa 346 

<210> 4 

<211> 372 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (372) 
<223> n = A,T,C or G 

<400> 4 

accagcctca tcaccccaga attacgctct tgcacccgtg cggctgggcc tcteagccgc 60 

tggctcggtr tggtcttttg aactggtang cagggcggtc cacagttcta atgtaagcac 120 

cctcctctcc aagctgcgcc ccgcggggac aaccattccc tgaacaccag agaggaaggc 180 

agcccaagcc gtcgaaaaga ctattgccta uttttgcttu caaagaccca cttgacgtca 240 

cgcggacagt gcacgtgcct tacgccacac cttgttttct aggaagaagg ggatgcnggg 300 

aaggancggg tgccttgcga cggacaaaac gnccaaataa cacaccctta catcctgaaa 360 

aaaacaaaac aa 372 

<210> 5 

<211> 698 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (698) 
<223> n = A,T,C or G 

<400> 5 

accagtanga tagaaacact gcgtcccgag agtaaggaga gaagctacta ttgattagag 60 

cctaacccag gttaactgca agaagaggcg ggacaccttc agcttcccac gtaaccgtac 120 

gcataaagcc aatgtagtcc agttcctaag accatgtccc aagctaactg aaccccactt 180 

caatacacac tcatgaaccc ctgatggaac aacaacaggc ccaagcctgt ggtatgacgt 24 0 

gcacacttgc cagacccaga aaaaatacta ctctcataaa cgggtgggag cattttgggc 300 

gacaacctac ctcgctcggc tgagtgaagg aatgacactc atacnttcat ctattccatg 360 

gacacttagt cagcgctctt tatataccag gcacgatgcc gagcgacact cctgtgcaca 420 

cntccaaacn ctngtncngt cgctgcacac acccgaaatc ccacatcaag ancttcccaa 430 

nacgangtcc ctggcctttc cacgccacct gatcngccaa ngaccccacc cccgcntgtc 540 

ccaaaaccnc ccnctnnang gctagacngg accccccccc tcccttcccg aanaacnaag 600 

cgtgngaaga nanccncncn cccccctnch rncnncccng ccngccnnr.c cncncgcngg 660 
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<220> 

<221> misc_f eature 
<222> (1) . . . (689) 
<223> n = A,T,C or G 

<400> 8 

accagcatcc aggaatgaac agtaaaagag gagcagttgg ctacttgatt acaacagagt 60 

aaatgaagta ccggatttgg gaaaacctgg ttttattaga acatatggaa tgaaagccta 120 

cac^cagcat tgcccactta gccccctgaa tcaacagagc ccaatcgaga caaacccccg ISO 

gcaacaggaa acccaaggga gaaaaagtaa gcaacttggg' ccaggatgag ctgaccccct 240 

tagagcaaag ganagacagc ccccattacc aaataccatt tttgcctggg gcttgcgcag 300 

ctggcagtgt tcccgcccca gcatggcacc ttatngtctt gacagcaact ccgccgaatt 360 

ttcaccaacc tatcacttga aatcataata cagcctgccc gctcgctgtn tccaggccgc 420 
gacatacntt cctagtggtc tgactccnaa aacaaatnag gttcantcct ctccccccnn - 480 

cnncnctncc nnccnctcnn cnntcccccc cnctcngtcc tccnnnntcn gggggagccn 540 

cccccncggn ggacccccct tcggtccccc agtggaggtt nacggcccct ggnnttatcc 600 

nggccntann c-cccccgcn nnaaatgntt ccccccccca nccccnccac ctcaanccca 660 

aagcccaagc ttnuaccctg ggggccccc 539 

<210> 9 

<211> 674 

<212> DNA 

<213> Homo sapien 



<:220> 

<221> misc_feature 
<222> ( 1) . . . (674) 
<223> n = A,T,C or G 



<400> 9 

gcccactctc ccttgagtgt actgtcctac tgtgcactct gcttttcaac tttctagata 60 

taaaaaatgc ccgccccata gtggagcaag agcccacaca ' cccaaggcag caagacaact, 120 

gaaaaaagcg aggctttttt gccaccttgg caaaggccag ctcaccgcta tagaactgct ISO- 

ataagcctga agggaagcag ctacgagacc ttccattttt ctcagctctc ccaataggcc 240 

ccctcatgga aaaaggcttc ccgtaacaat tttcacctaa tgaattagca gtgtgattat 300 

ttctgaaata agagacaaat tgggccgcag agtcttcccg tgattcaaaa taaacaaccc 360 

aaagttttgt tcggtctcca ccaaaggaca tactctaggg ggtatgtcgt tgaagacact 420 

caaaaacatt agccgttctg tctttcaatt tcaagctatt tcggagactg cctccacgtg 480 

agtcaattac tttgctctgg aactagcatc attgtcatta tcaccacatc ctgtcatcat 540 

catctgaata atactgtgga cttccccctc tgcttgcacc ttcttttgac tcctctggga 600 

anaaacgtca aaaaaaaagg ccgatccact cngcaaggnc catctaatca ctgcgctgga 660 

aggacccnct gccc g74 

<210> 10 

<211> 346 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (346) 
<223> n = A,T,C or G 



<400> 10 
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60 



actagtctgc tgacagaaag cactacacac cctattgttt ctttctttcc aaaatcagcc 

ttctgtctgt aacaaaaatg tactttatag agatggagga aaaggtctaa tactacatag 120 

cctcaagcgt ccctgtcatc gttcaagtgc actctctgta acagaaacat attcggaatg 180 

tctttcttct cccctcataa atcgcaaccc ccgaaacacc gctgcttcaa aaagtcccac 240 

tgtcagatta tattatctaa caattgaata ctgtaaatac actcgtctta cctctcaaca 300 

aaagggtact ttcctatcan nnagnngnnn gnnnnataaa anaaaa 34$ 



<2L0> 11 
<211> 602 
\ <212> DNA 

<213> Homo sap i en 

<400> 11 

accagcaaaa agcagcattg ccaaacaatc cctaattttc cactaaaaat ataatgaaat 
gacgctaagc ccttcgaaaa gttcaggtca aacctaccgt cgttagatta acgtacttgc 
tgcccccctc tacctggaat gtggcattag ctcttttatc ccaaccctct ttaactctta 
ttcaacccca tgacttaagg ttggagagcr aaacactggg acttctggat aacagactga 
cagtrtcgca taaccataat cggcattgca catagaaagg atatggctac cttctgtcaa 
accTigcaccc ccraaacacc aaaaaaggga aatgaagcca caaatcaacc tttgcacaar 360 
ctgnctgaaa cacgagcttc attcgcccaa caccagggcc ttgccccctt tctgcaagtc 
tcctgggacc ctgtgcagaa ctgccctcac -aaacaccaa acagttaagc ccattctcrg 



aa 



<210> 12 

<211> 635 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (685) 
<223> n = A,T,C or G 

<400> 12 



60 
120 
180 
240 
300 



420 

_ ^_ 480 

gcac-agcca caaattcggt ctcatattcc actcaacaat ncaaacaaac cgaaacactc 540 
ctagacggtc cactcctgtt catataaaaa caaaacttga cttccaaaaa aaaaaaaaaa 



600 
502 



actagccctg tgaaagcaca actgaaggca gaaagtgtca ggattttgca tctaacgttc 60 
atcaccacgg tattgatgga cctaagaaaa taaaaattag actaagcccc caaacaagct 120 
gcatgcattt gtaacatgat tagtagattt gaatatatag atgtagcatn ccgggtatct 



_ , — 180 

aggtgtccta tcactacgta aaggaattaa agcaaaggac tttgtagttg tttccattaa 240 

atatgcacat agtagagtgc aaaaatatag caaaaatana aaccaaaggc agaaaagcat 300 

cttagatatg ccttaatnta nnaactgtgc caggtggccc tcggaacaga cgccaggcag 360 

agaccagcgc ctgggtggtg cctccccttg cctgcccccc tgaagaactt cccccacgcg 420 

angtagtgcc ctcgtaggcg tcacgtggan tantggganc aggccgnncn gtnanaagaa 480 

ancanngcga nagtttcncc gtngangcng aactgtccct gngccnnnac gcccccanaa 540 

cntncccaat ngacaatcga gcctccnnnc rccngnaacc cngccgnnnn cnngcccnnc 600 

cantnrgnca accccgcgcc cggaccgctc tcnnnccgct ctcncncnaa ngggntttcn 660 

cnnccgccgt cncnnccccg cnncc 635 

<210> 13 

<211> 694 ; 

<212> DNA 

<213> Homo sapien 

<220> 
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ccccngcngn angcngtgcg cnncangncc gngccgnncn ncaccctccg nccnccgccc 600 
cgcccgctgg gggcccccgc cncgcggntc antccccncc cncncgccca ctntccgntc 660 
cnncnctcnc gcccngcgcn cgcccnccnc ccccccc 697 

<210> 18 

<211> 670 

<212> DNA 

<213> Homo sapien 

\ <220> 

<22l> misc_feature 
<222> (1) . . . (670) 
<223> n = A,T,C or G 



<400> 18 

ctcgtgtgaa gggtgcagca cctaagccgg agcggggtag aggcgggccg gcacccccz- 60 

ctgacctcca gtgccgccgg cctcaagatc agacacggcc cagaacttga acgacccggc 120 

gggacggctg cccgccgggc cccggggcac gggcacggcc ccgaagctgt tgctgcgggc 130 

cggcgccgtg gccracggcg cgcgcgaacc tgtgtccacc gcggaaggcg ggcncagagc 240 

cacctiucttc aaccggatcg gtggagtgca caggacacta tcccgggccg anggcctcca 300 

cttcaggatc ctcggctcca gtaccccanc acctacgaca cccgggccag acctcgaaaa 360 

aatcticctcc ctacaggctc caaagaccta cagacggcga acatccccct gcgagccccg 420 

cctcgaccaa cgctcangaa cttcccaaca ngctccancg cccaagggcc ggactacnaa 480 

gaacgantgc tgccgtccat cgtcacgaag cgcccaagaa tttnggtggc caagctcaac 540 

gncctracnn ctgatcnccc agcggggcca agttancccc ggccgatccc cgggganctg 600 

acnnaaaagg gccaaggact tccccccacc ccggacaatg tggccntcac aaagcrcaac 660 



<210> 19 

<211> 606 

<212> DNA 

<213> Homo sapien 



<220> 

<22l> misc_feature 
<222> (1) . . . (606) 
<223> n = A,T,C or G 



<400> 19 

accagtgcca acctcagctc ccaggccagt cctctgaatg tcgaggagtt ccaggatctc 60 

tggcctcagt cgtccttggt tattgatggg ggacaaattg gggatggcca gagccccgag 120 

cgccgccttg gctcaactgt ggctgatttg tccgtgcccg gaaagttcgg catcactcgc 180 

ccaggctgtg ccctggaaag tactacagcc atcctccaac agaagtacgg actgctcccc 240 

ccacatgcgt cctacctgtg aaactctggg aagcaggaag gcccaagacc tggtgccgga 300 

tactatgtgc ctgtccactg acgactgtca aggcctcact cgcagaggcc accggagcca 360 

gggcactagc ccgactctta aggcagtgcg tctttctgag cactgtagac caagcccccg 420 

gagc-gctgg cttagcctcg cacctgggga aaggatgtaz tcatctgcat tttcatacac 480 

cagccaaaag ctgaatggaa aagttnagaa cactcccagg cggccttact ctaacaagct 540 

tccuczigtct gccccgtttc tcaatcgaaa agtcatcaaa caacagatct agaacctaat 600 
gagacc ' 

<210> 20 

<211> 449 

<212> DNA 

<213> Homo saoien 
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<222> (1) . . . (669) 
<223> n = A,T,C or G 

<400> 23 

actagtgccg tactggctga aacccctgca ggaccaggaa gagaaccagt tcagactttg 60 

tactctcagt caccagctct ggaattagat aaattccttg aagatgtcag gaatgggatc 120 

tatcccccga cagcctctgg gctgcctcgg ccccagcagc cacagcagga ggaggtgaca 180 

tcacctgtcg tgcccccctc tgtcaagact ccgacacctg aaccagctga ggtggagact 240 

cgcawgtgg tgctgatgca gtgcaacact gagtcggtgg aggagggagc caaacaccac 300 

ctgacacctc tgctgaagtt ggaggacaaa ctgaaccggc acccgagctg tgacctgatg 360 

ccaaacgaga atatccccga gttggcggct: gagctggcgc agctgggctt cactagtgag 420 

gccgaccaga gccggttgac ttctctgcta gaagagactt gaacaagttc aaccttgcca 480 

ggaacagtac cctcaactca gccgctgtca ccgcctcccc ttagagctca cccgggccag 54 0 

gccctgacct gcgctgtggc tgtccrggac gcgccgcacc ctctgccctt ccccccagtc 600 

agcattacct gtgaagccct tccccccttc attacccagg anggctgggg gggctccccg . 660 

nctctaacc gg5 

<210> 24 

<21i> 442 

<212> DNA 

<213> Homo sapien 

<:400> 24 

accagcacca tcctgacaga ggatacacgc ccccaaaacg rttgtcacca cacctaaaaa 60 

tcactgccat catcaagcat cagtttcaaa atcacagcca tccatgatnc acttctucca 120 

gatgaccatc attattctag tcctttgaau ttgtaagggg aaaaaaaaca aaaacaaaaa 180 

cccacgatgc actcttctcc agcacatcag acttcaaatt gaaaattaaa gacatgctae 240" 

ggcaatgcac ttgccagtac cacacacttt ggtacaacaa aaaacagagg caagaaacaa 300 

cggaaagaga aaagccttcc cttgttggcc cttaaaccga gtcaagatcr gaaatgtaga 360 

gatgatctct gacgatacct gtatgttcct actgcgtaaa caaaactgcc ggcacgaaac 420 

gacctaaaaa aaaaaaaaga aa 442 

<210> 25 

<211> 656 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (656) 
<223> n = A,T,C or G 

<400> 25 

tgcaagtacc acacactgtt tgaactttgc acaaaaagtg actgcaggat caggtgacag 60 

ccccggaatg tacagtgtct tggtgcacca agacgcctcc taaaggctga cataccrtgg 120 

accctaatgg ggcagagagt acagccctag cccagtggcg acacgaccac tccctttggg 180 

aggcctgagg cagaggggag tggtatgtgt tttcccagcg gaagcagcac atgagcgggc 240 

gacaggatgc tagataaagg ctctagtcag ggtgtcattg tcatccgaga gactgacaca 300 

cccccagcag ctggcaaagg ggcgccggan gccatggagg anccccagaa acaccagcac 360 

gggctgatct gatcacttcc tggcaccccg ctcaccttta cgggaagtcc cattagangg 420 

atgggacagt ttcccatatc cccgccgcgg agctctggaa cactccctaa atttccccct 480 

actaaaaatc actgccctaa ccacactccc ccctcgaagg aatagaaacg gaacctrcrc 540 

cgacacanct ctcggcatgg ggagccagcc acaaatgana atctgaacgt gtccaggttt 600 

cccctganac tcatccacac agaatcggc" aaaccccccc ttggaacaag gaaaaa 656 
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ggaaggggcg aaagacacgt gggataaact gagaaaagaa nccaaaaacc tcaacatcca 120 

aggcagctta ttcgaactcr gcggcagcgg caacggggcg gcggggtccc tgcccccggc 180 

gtccccggcg cccctggtgc ctccctcggc agcttcagcg acctgnctct ccccccgagc 240 

gtggggccag ccccccccgc ggcgcccacc cacnctcact ccacgctccc ggaaatcgag 300 

aggaagacca ctagttcttc ggggacgtcn gtgactctct gcgatgctga aaaacaccca 360 

tacagggaat gtgggaaatc ccganccctt tnttacntcg tntgatttct tgtgctttac 420 

ttgccaaaac gtcaccaacc agtgaccaac cnagcacagc caaaaatcgg acntcngctt 480 

tagtccgtct tcacacacag aacaagaaaa cggcaaaccc accccacctt tnanctcnac 540 

taci;actaan ttctctctgt rgggcaaaag aatctcagga acngccctgg ggccnccgca 600 

ccanagttaa ccnagccagt tncatgaaaa atgatgggct ccncctcaat gggaaagcca 660 

agaaaaagnc ^-jq 

<210> 29 

<211> 551 

<212> DNA 

<213> Hcmo sapien 



<220> 

<221> mi3C_feature 

<222> (i) . . . (551) 

<223> n = A,T,C or G 



<400> 23 

accagtcctc cacagcccgc gaatccccct 
agatctcagc gttcagccac ctcacccacg 
cccccrccag ccactgacgg gaaagcactc 
ccagcaccag aggzgcccga cgccgcacac 
ccccccgacc taagrcgcgg tccagaagct 
cgcaatgaat gtcccagggc agaaaaagag 
aaagcaatag ccgacgggaa gaggagctcc 
aaaagcgaaa trgggaagac aaaagctcaa 
aggaaggaag agagaagaga gacnaagatc 
aaaaaanaaa a 

<210> 30 

<211> 684 

<212> DNA 

<213> Homo sapien 



agacctctca agcatagtga gcggagaaga 60 

cccgatgact ctgcagaaaa ggctccctcc 120 

tccatcagct ctcaaaanca gcaagaatct 130 

tngccacccg agaagctggg accctgcccc 240' 

acagcaccgg tagccccaga tLcccc"tac 300 

gacacncaga tgctcccaaa czctzczzcc 360 

agcagcagca ggaacatcga aaacagaaaa 4 20 

cagcatttgg caaggagaaa aganaagatg 480 

nctacggacc gnnncggaag aagaagaagn 540 

551 



<220> 

<221> misc_f eacure 
<222> (1) . . . (684) 
<223> n = A,T,C or G 



<400> 30 

accagcccca tctggaaaaa gcccgggtcg gaagaagccg tggagagtgc gtgtgcaacg 60 

cgagactcat tccttggaag cacccctggc aaaaacgcag ctgagtacaa ggccaccact 120 

gcgatagaac ccggaccgcr tcttgagata atagagatgc cgcagtctga agagacttcc 130 

agcacctccc agttgaacga atcaacgatg gcctccgagc caactncacc ggctcaggaa 240 

ccacgagaga tgaccgcaga cgtaaccgag cttaaaggga aatccctcac caacrtagaa 300 

ggtggtgaca ttrgtgaaga gtcttcctac aaagcaactg tcacgccgac tacgaaagaa 360 

aaacgccccc gttgctggaa gcatacagcg ggagccttca gacacactgt gcccrcgacg 420 

tgcagaagtc gccagcggga aaatagcatc aacagcccac ccgagcaaga accctcctga 4 80 

cagcactggg ctagaagccc ggacggacca cttacaacae aggaaagaaa gccaagaacr 540 

aggcnacgag cggatgagca aacggcggan gacggggaac ccaaaccaga accacggaag 600 
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<220> 

<221> tnisc^f eature 
<222> (1) . . . (673) 
<223> n = A,T,C or G 



<400> 33 

actagtcatt tactttcccc cgcttcagaa ggtttttcag actgagagcc caagcatact 60 

ggatctgttg ttccttttgg gtcccacccc accagcgtgc acagcggcag aaattataaa . 120 

gaaggtcgaa aggagcaggg aaaagatcca gaagcatgct agttcgacac catcatcctt 180 

tctegaagta cgatgcatac cgcatcactt tatttgcaaa ctaggaaccg cagtctgagg 240 

atcacctaga agggcaagcc caagaggata tgaagatttg agaactttcc aactatccac 300 

tgaccaaaaa tgaacattaa tgtcnaagac ttaagacttt aacctgccgg cagtcccaaa 360 

tgaaaccatg caactctgac atcatatccc ttgatttaaa ttgggccttt gtgactgant 420 

gaaactctac aaagcatatg gtcagttatt tnattaaaaa ggcaaaaccc gaaccaccct: 480 

ccgcacctaa agaagcctaa cagcacaaat acctatctat cttagacgga tntatccntr - 540 

tncaccccta aatactgtac tacctacggn nggcggggct ctcttaccaa tacacaaacn 600 

aactcaccat tccaanggca ctctatttgg gtccagaagc tgactccaag nantgcatat 660 

ttcgccactg tnc 573 

<210> 34 

<211> 684 

<212> DNA 

<213> Homo sapien 



c220> 

<22l> misc_feacure 
<:222> (1) . . . (684) 
<223> n = A,T,C or G 



<400> 34 

actagtttat tcaagaaaag aacttactga ttcctctgtc cccaaagcaa gagtggcagg 60 

tgaccragggc cggtgtagca tccggttcct ttagtgcagc caaccgcact tgccactgat 120 

gaccaaggag gaaaccacta agacatttga gaagcagcgg catgaacgct cttggacaag 180 

ccacagccct gagcctcaac cctgtagtct gcacacaaga acgagctcca cccccccccc 240 

etcaggagga accrgcgcgg atagattggc tggacctttc aatggctc-g ggtcgcaagt 300 

gggcactgcc atggctgggt atggagcgga cagccccagg aaccagagcc ccagcccggc 360 

cgcccggttg gaaggtacag gtgttcagca ccttcggaaa aagggcataa agtngcggcg 420 

gacaactctc agtccaagaa gaacgcattg accattgctg gctatctgcc tncccagtan 480 

gaatcggatn cattttcgac cangatnnct ctnctatgcc ttnttgcaac gaaatcaaat 540 

cccgcattat ctacaagtgg tatgaagtcc cgcnnccccc agagaggcrg ttcaggcnac 600 

gtcttccaag ggcagggtgg gttacaccat tttacccccc ctctcccccc agatcatgna 660 

cncagaagga atctnttccc tccc 684 

<210> 35 

<211> 614 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_f eature 
<222> (1) . . . (614) 
<223> n = A,T,C or G 



<400> 35 

acragtccaa cgcgccngcn aatattcccc nggtagccca cccccccacc cccaaacar. 60 
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tnanaccaac agcnacngan natnggggct ccccngggcc ggngcaacnc tcccncaccc 
cggcgcnggc ctccggtgnt gtcccccntc aacnaattcc naaanggcgg gccccccngc 
ggacccctcn tcgtcccctc c 

<210> 38 

<211> 687 

<212> DNA 

<213> Homo sapien 

\ <220> 

<22l> misc_f eacure 
<222> (1) . . . (637) 
<223> n = A,T,C or G 



<400> 38 

canaaaaaaa aaaacatggc cgaaaccagn aagctgcgcg atggcgccac ggccccccci 60 

cccccggcct gtgcccggaa ggtccccccc cgaggcgccc cggctcccgc aagcggagga 120 

gagggcggga cncgccgggg ccggagccca naggccccgg ggccgct^ug ctcccccgcc 180 

accgcaaggg cggcgccaac ctnaggcccc cccgcaaagg cccccnangc ggnggcggcg 240 

gggggctgtg anaaccgcaa aaanaacgcc gggcgcgcng cgaacccgcc cacccccgcg 300 

aaggananac ttccacagan gcagcgtctc cacagcccan agccacattr ctaccgcgac 360 

gcaccccagn aagcccccgn cggggaagcc caccgctgtc aaaaaancrc ttcgctccac 420 

cggcgcacna aggggangan ggcangangc tgccgcccgc acaggtcatc tgaccacgtc 480 

gcccgcccca ntccgccttt gtgaatcccc acttcgttca accccacccg ccgccccctc 540 

cccctcgcgc ctccctccna ccctaanaac cagcctcctr: tacccnaLng canccnctrt: 600 

gcncnngcng aaaccaactc ggcccnccgg aacccccrnc ccgtggcaac tgctnaaaga 660 

aacrgccgct ccgnccaccg cngcccc 587 

<21G> 39 

<211> 695 

<212> DNA 

<213> Homo sapien 



<220> 

<22l> misc_feacure 
<222> (1) . . . (695) 
<223> n = A,T,C or G 



<400> 39 

actagtctgg cctacaatag tgtgattcat gtaggacttc tttcatcaac tcaaaacccc 60 

tagaaaaacg catacagatt atataagtag ggataagatc tctaacactt ctgggctctc 120 

tgacccctgc gccagactgt ggaaagggag tattattata gtatacaaca ccgctgttgc 180 

cttatcagtt ataacatgac aggtgctgaa ttgtgattca caatttaaaa acactgtaac 240 

ccaaactttt tttttcaact gtagaccacg catgtgaacg ttaacgtcaa ttcgctcaan 300 

gctgtcatgg gtagaaaaaa ccacacgcct taaaattcta aaaagcaggg cccaaaccta 360 

ttagtctaaa attaggggta tgtttccagc ctgttactaa ncggttacag ctccgtccag 420 

aanaaatcna ngaacangac ttngaaantc aagntgacac cactcnccag tgacttgtca 480 

atttgaaatc anacacggca cctcccgtcc tggtnctatc ggnnccrgaa -ccaancngg 540 

ntccaaatcc tntcggaaac ngtccntcca accttcccac nanatct-iat: tttt^cactc 600 

cggaacggcc ctatctaang ttaaaagggg ggggnnccac naccacrcnr gaacaaaacc 660 

naatatatat ccctggtccc ccaaaactta aggng 695 

<210> 40 
<211> 674 
<212> DNA 
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OEr aaBOBBBODB BBBODOBDDO DBBBO^C-aDD OODBBOBOBb OOOqOOBQUB BBUB66B23B 

QZf ■ -5:ioaaBaBD oaaB^BBuao aBoaoaaoBq oaaBDTiBBBa a^BaBB^oBO aBBB^BBBaa 

09e bbbbb^bBbB Bq:iD:iqH:ioo iiaaaBBOB^B BaBz^ooBeBB 5:i^q3BBaza oaaBBoaBBq 

OOe Z-3B6oaaaB aBaOaaoaoa aoaaqBaB^B BfiBqaOBBBo aBBBOBDDBa BBDBBaBaaB 

OfZ BC^BBaoBaB BBfiaoafioBq BBoB^BBBBa aoBOBafiB^B BB^BBBaaoa BD^DBBaaoa 

081 ^ruBaaaaao BBaoqaBaoB Bbbb^Bb^bb Bo^BBBBaaB BaBCOfiBaBu bbbbBbdezj^ 

OZZ oBaBaoaaBo BfiBB^BaaBB BBafiBoBqao BaBBoDoaBB BBfiBBBBfiBa BBqs^BaqBa 

09 aZiOBDoqqqB BfiqBqBqqqB oqqDBBqaoo B3BqD63q66 qBeBBfiaaBB oqfiBq&BqoB 

Ofr <00^> 



0 :to O'x'V = u <ZZZ> 
i^LS) (T) <ZZZ> 
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actagcgctg aggaatgtaa acaagtttgc tgggcctcgc gagacttcac caggttgctt 60 

cgatagctca cactcctgca ctgtgcctgt cacccaggaa cgtctttctt aatcagaaga 120 

caggaagaaa acaaaaacca gactgtgtcc cacaatcaga aacctccgtt gcggcagang 130 

ggccctcacc gccaccaggg tgccccgcca gacagggaga gaccccagcc ctctgaggcc 24 0 

atcccgaaga attcccgttt gggggttgcg aaggaaaacc acccggattt aaaaagacgc 300 

cgttccctgc ccgcgtngtn gggaagggac tggtctcctg gtgaatttct taaaagaaaa 360 

atatuttaag tcaagaaaaa aaaaaaaaa 389 

<210> 43 
V <211> 279 
<212> DNA 
<213> Homo sapien 

<400> 43 

accagtgaca agctcccggc ctcgagatgt cttctcgtta aggagatggg cctcttagag • 60 

gcaaaggata aaatgaacga gtcctgtcat gattcactat cctagaactt gcatgaczc:: 120 

caccgtgcta gctctttgaa tgtccctgaa attttagacc ttccttgtaa acaaacaaca 130 

cgtccctatc attgcataaa agctgttacg rgcaacagcg cggagacccc tgcctgactc 240 

aataaaacac ccaaacactg aaaaaaaaaa aaaaaaaaa 279 

<210> 44 

<211> 449 

<212> DNA 

<213> Hcmo sapien 

<:220> 

<22l> misc_feature 
<2'22> (1) . . . (449) 
<223> n = A,T,C or G 



<400> 44 

actagcagca tcttttccac aacgtcaaaa ttgcagaagc agcttatcat taaaaaacaa 60 

caacaacaac aacaacaata aatcctaagc gtaaatcagc catcctaccc cctaccaagg 120 

atatcagccc gtttttcccc ttttctctcc cgggaataac cgtgggcccc tccccaaaut 180. 

tccacagcct ccctcctctt ctcatgcttg agcttccctg cttgcacgca cgcgttgtgc 240 

aagancgggc tgtttngcct ggantncggt ccnagcggaa ncacgccntc cctcgctacc 300 

gttggaagaa acccaaacct tcnancccta ggtgtcncca ttttgtcaag tcaccactgt 360 

atttttgtac tggcatcaac aaaaaaagaa atnaaatatt gctccatcaa actttaacaa 420 

aacrttaaaa gggaaaaaaa aaaaaaaaa 44 9 



<210> 45 

<211> 559 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (559) 
<223> n = A,T,C or G 



<400> 45 

accagcgtgg gggaatcacg gacacttaaa gtcaacccgc gaaataatcc ttt::atC3ca 60 

cacccactga agtcttitgag ccccagagag ccacrctacg ccaaacaccc caagcacrct 120 

tcgagagccc agcatcacac caacatgccc gtgcagctca aaccgaagtc cgcaggcaaa 130 

cccgaagcrc tgctcgtcac tcaaacagai: gaaggcaaga gcaccgc::ac tcgactaari 240 
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OZl TzaBafiqBaB afiqBqBqBzq aBqBaB3B3B BBqoBBBBaa aBqasBOBBq oooqqBBBoq 
09 22zzc-3qBB zzqBOoaaBa BZ3B30BB0B asBBaBoaBq oBBaBcoBzB bzcuuBbzdb 

9^ <OCv> 

O JO 3'X'V = u <£:^3> 
iXZL) - • (T) <23?> 
3jnqB5I DSTUl <T32> 

uatdBS oiJUOH <ZXZ> 

Mm <ziz^ 

IZL <ZXZ^- 
9fr <0T3> 

bbBBbbbbb bbbbbbbbbb 

0^9 BBiBBBqB^B BzqauBqBqB BBBBqBZBBB qBB63D-iBq BBqBqzBfiqa BoBaqqBqBz 

081^ OZaBBBBBBB BBBuBBBboB qqqqDBBZaD BqBqaBqBBq BDOOBBqcqB qBZBqaB^BB 

Oc?' BBqoBfiDqoe BBBBqoBqBB qoBBBDHq-q aqBqzqaBBe zBqaBBqBqo qB^BBBaBBq 
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<210> 48 

<211> 257 

<212> DNA 

<213> Homo sapien 



<400> 48 

accagtatac gaaaatgcaa ataccacttg tgtactcaaa caaaagttgg tcttaagctc 60 

ccacc-cgag cagccttgga aacccaacct gcccctccta gcacaaccac acctcccaaa 120 

tgactttctt tgtccctgaa aaagtgattt gtattagtct tacacttgtc tttcggaaga 180 

ttata-ttcgt atacgtatca tcataaaata cctaaataaa aagtatctcc agagtgaaaa 240 

aaaaaaaaaa aaaaaaa 0C7 



<210> 49 

<211> 652 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feacure 

<222> (1) . . . (652) 

<223> n = A,T,C or G 



<400> 49 

accagtccag acgagcggcc gctgaagggg ccccctcgtc atttccatca taacccaatt 50 

cccacttatc cgaactczca agtcataaac gcacaacgac tcacgaacta gcacagc.taa 120 

gtrgacacca gaaactcccc atttccgcat tacaccacca aacaggaaac atcggaaaga 180 

cggggaaaaa aacccccttc taaaatggct cagaaagttz ccagatcact tcgaaaactc 240' 

taaacccctt tctgttccca aaacctgaaa aiacgtagat ggacccacgc accaagaccg 300 

ctcccaaagc tctcctcaca tcttcaaagt gcgacttccc ttttaacaca cacatttacr 360 

ttccttaaag cagccatacc ccaacccatg actctggaga cacacctatn aaaccaacac 420 

aacagcangg tcacngaagc agctctctca aacgttgcct cagatgtgca agttgcaaat: 480 

tttatcgcat ctgcanaata caattttcgt tttaaactgc atttcaatct acttccccaa 540 

gatgcttttc atacagagcg aaatatccca ngacaactgc ttccgcgtcg tcgcacctga 600 

cgcataactg cacaaacgaa cagtgtatac ccctcggtcg cgcaccnacc cc 6 52 



<210> 50 

<211> 650 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 
<222> (1) . . . (650) 
<223> n =s A,T,C or G 



<400> 50 

ttgcgctccg attttcttag ggcctgtgcc crgtttcact tacagggtc" agaatgcttg 60 

tgctgagtaa aaaggagatg cccaatatcc aaagctgcca aatgctctct tcgccataaa 120 

gaccccgcgc aaccgtgtga acactcggga tttccctcct ccgccccgag gccctcgtct 180 

gccctctttc ctgggttccc cctagaagat tgagaaacgc acacgacagg ctgagancac 240 

ctccccaaac acacaagctc tcagccacan gcagcttccc cacagcccca gcctcgcaca 300 

ggctcccgga nggccgcctg ggggaggcag acacgggagt gccaaggtgg ccagacggtr 360 

ccaggactac aatgccccca ccttcaactg ttcgccaccg ccgcccccac ccctgccccg 420 

ccccggagta zcgzczgccc canacaagcg ggancgaaac gggggtgggg gggaacactg 4 80 

acccccancc agggggcgcc caactgaaca gcagggacan aaggcgtgaa cccgngaant: 540 
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<221> misc_feature 
<222> (1)...(502) 
<223> n = A,T,C or G 

<400> 53 ^ 

tgaagaccct ggtgtcgcca tgggccgccg ccccgcccgt tgctaccggt actgtaagaa 60 

caagccgcac ccaaagtctc gcctctgccg aggcgccccc gatgccaaaa tccgcaccct 120 

tgacccgggg cggaaaaang caaaantgga cgagcctccg ctttgtggcc acatggtgtc 180 

agatcaatat gagcagccgt cctctgaagc cctgnangcc gcccgaactc gtgccaacaa 240 

gtacatggca aaaagtngtg gcnaagacgc tcccatatcc gggtgcggnt ccaccccttc 300 

cacgccatcc gcaccaacaa gatgttgccc tgtgctgggg ccgacaggcc cccaacaggc 360 

atgcgaagtg ccrctggaaa acccanggca ctgtggccag ggtccacatt gggccaaccn 420 

atcacgttca tccgcaccaa ccgcagaaca angaacncgc naatcnaagc cctgcccagg 480 

gncaantcca aatttcccgg cc 502 

<210> 54 

<211> 494 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) . . . (494) 

<223> n = A,T,C or G 



<400> 54 

actagcccaa gaaaaatacg cctaacgtac actacaaagg ctccgtacat gccaacccgt 60" 

cctaacgcca aaagtctgcc ttgcccacaa cctcctcaag acccctccag aaagggacrt 120 

gtttgcctca acgaacactg tcgggaaaaa acacagcaca acgagcgaaa agggcagaag 180 

caagaaattt ccacacccta gcgactccaa gaagaatgag tacccacacc tagacggcac 240 

atca-gagga ctrtaatctt cccttaaaca caataatgtt ttctttttcc ctttatccac 300 

atgatttcta agtacacctc tcacgcagga cagttctcca accttgacgt acagtgaccg 360 

tgtnaaattc tccttccagt ggcaacctct ataaccctca aaatatggcg agcatctcgc 420 

ctgtrttgaa ngggacacga cnatnaacct atcagatggg aaatcctgtc cccaagtrag 480 

aaaaaaaaaa aaaa 494 

<210> 55 

<211> 606 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (606) 
<223> n = A,T,C or G 

<400> 55 

actactaaaa agcagcaccg ccaaacaatc cctaacrtcr cactaaaaat ataacgaaac 60 

gacgtwaagc ttrtcgaaaa gctcaggtta aacctaccgt tgttagacca atgtattcgc 120 

tgctrccctc tacctggaat gtggcattag ctttctcacc ctaaccccct ccaaccctta 180 

tccaactcca cgacttaagg ctggagagct aaacaccggg atttttggat aacagaczga 240 

cagttctgca caactataac cggcactgca catagaaagg acacggccac ctttcgttaa 300 

atccgcaccn tccaaatacc aaaaaaggga aatgaagcac aaaccaatrt ctgcacaacc 360 

cgctigaaac atganctcta cttgctcaat atcanggctit cgcccrrtrc cgccagtczc 420 

ttgcgaccct gcgcaaaact gccctcatta aacaccaaac agccaagzcr atccccrggt 430 
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<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 

<222> (1) . . . (649) 

<223> n = A,T,C or G 



<400> 59 

acca^ccact atccgacttt cnggttacaa tcattctaat gagtgtgaag tagcctctgg 60 

cgccactcgg atttgcattt ctctgacgag tgacgctatc aagcaccctt gctggtgccg 120 

tcggccatat gcgcatgttc cccggagaag cgtctgcgct gagcctcggc ccacttttta 180 

attaggcgcn tgtcctttta ttactgagtt gtaaganttc ttcatatatu ctggacccca 240 

gaccctcacc agatacacgg tctgcaaaca tcctctccca ttctgtgggc cgtgttttca 300 
ctctaccgat aangtcctta gacatacaat aaatctgtat cttaaaagcg acttgatctg - 360 

ggctgtgcaa ggcgggctca cgcctgtaat cccagcactt cgggagactg aggtcggtgg 420 

atcatacgan gangctagga gttcgaggtc agcctggcca gcacagcgaa aacccgtctc 430 

nacnaaaaac acaaaaacca gccaggcatg gcggcgcacg tctgcaacac cagczncrca 54 0 

ggangccgan gcacaaggac cacttgaacc ccagaangaa gangctgcag cgancugaac 600 

accacgccag ggcaacaaaa acgagaacct gcttaaaaaa aaaaaaaaa " 64 9 

<210> GO 

<2il> 423 

<212> DNA 

<213> Homo sapien 



<220> 

<221> mis cofeature 

<222> (1; . . . (423) 

<223> n = A,T,C or G 

<400> 60 

actagcccag gccctccagc ccactgacaa acatggggaa gtgtgcccag ctggctggaa 60 

acccggcagc gataccacca agcctgacgt ccaaaagagc aaagaacacr cctccaagca 120 

gaagcgagcg ctgggctgct ctagtgccag gctgcggcgg gcagccatga gaacaaaacc 180 

ccttctgcat ttnttctttc cattagcana acacaagacc cngattcagc cgaatcgcgg 240 

cgtcccacaa ggcagggctt tcctacaggg ggtgganaaa acagcctttc ctcccttggi: 300 

aggaanggcc cgagttggcg ttgtgggcag gctactggct tgcacgacgc attagtagag 360 

caacccacta atcttttgta gtttgtatna aacttganct gagaccttaa acaaaaaaaa 420 



<210> 61 

<211> 423 

<212> DNA 

<213> Homo sapien 



<220> 

<22l> misc_feature 
<222> {1)...(423) 
<223> n = A,T,C or G 

1 

<400> 61 

cgggactgga acgcaaagtg aagtccggag ccctgagcac gggctcctcc cgccgggccc 60 
cccctcccca gacrccagag ggagaggccc accccgccca gccccgcccc agcccctgct 120 
caggcccgag cacggccggg agtcgggggc cacaggcccc cagccgtgcz gcccaagaac 130 
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<210> 64 

<211> 313 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feacure 
<222> (1) ... (313) 
<223> n = A,T,C or G 

<400> 64 

accagttgcg caaaccacga ctgaagaaag acgaaaagtg ggaaacaacc cgcaacgcct 

gtcagagatg gtcgccacac atgttgggtc tgcagagaaa catctcgagg agcagactgc 

taaagttgac agagaatatg aagaatgcac.gtcagaagac ctctcggaaa ataccaaaga 

gaccagagac aagcatgaga agaaagctac cctaactaag ccttctgaag aatgaagacn 

aaacgttgat catgtatata tacccacagt gaacaaaacc gcctcagcaa agctccaaaa 
aaaaaaaaaa aaa 

<210> 65 
<211> 420 
<212> DNA 
<213> Homo sapien 

c220> 

<221> misc_feature 

<222> (1) . . . (420) 

<223> n = A.T,C or G 

<400> 65 

actagtcccc tggcaggcaa gggcttccaa ccgaggcacz gcacgcgcgg cagagagagg 60 

caggaagctg gcagcggcag ctcccgtgtc tagggaggcg cgtggcrccc tccttccctg 120 

tccgggaggt tggagggaag aacctaggcc ttagcctgcc ctcccgccac ccttcccctr 180 

gcagatactg cctcaacact ccctcctctc ccagctgtgg ctgccaccca agccaggrct 24 0 

cnccgcgcnc accaacttat ttccaggaaa ggcgcgtgga agacacgagc cgtgtacaat 300- 

atrtgtttta acattttcat tgcaagcatt gaccatcacc cctggccgcg catcgttgta 360 

acacaaatta acgatattaa aaagcatcca aacaaagccn annnnnaana nnannngaaa 420 

<210> 66 

<211> 676 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 
<222> (1) . . . (676) 
<223> n = A,T,C or G 

<400> 66 

actagtcccc tacgaccatt aaactcattc tcagggctaa gaaaggaatg caaacttctg 60 

cctcaattcg taccccacca acaagttctc gaagagtgca gatttctagt caggtcttaa 120 

aaataaaccc acaaatctgg atgcatttct aaattctgca aatgtttcct ggggtgactt 180 

aacaaggaat aatcccacaa tatacctagc tacctaatac atggagctgg ggctcaaccc 24 0 

accgtttcta aggatttgcg cttacttgtg gctgaggaaa aataagtagc tccgacggaa 300 

gcagttttta aatgcgagct tatagatngg aaacagaata tcaacttaat tatggaaatt 360 

gttagaaacc tgctctcttg ttatccgaat cttgattgca attactattg tactggacag 420 
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<220> 

<221> misc^f eacure 
<222> (1) . . . (39S) 
<223> n = A,T,C or G 

<400> 69 

cagaaacgga aagcagagtt ttcatttctg cttataaacg tctccaaaca aaaatggaaa 60 

gcagagtccc cattaaatcc tttcaccttt cctttttccc ggtaatcccc tcaaataaca 120 

gtat^ftggga tattgaacgt caaagggata tttttttcta ttacctttat aactgcacaa 180 

aattaagcaa atgttaaaag tcttatacgc tttattaacg ttttcaaaag gcacnacaca 240 

cgtgacacat tttttaagct tcagttgctt gtcttctggt actttctgtt atgggccttt 300 

ggggagccan aaaccaatcc acnatctcct cttgtttgcc aggacatgca ataaaactca 360 

aaaaataaac aaaaaccatc nagaaattga aaaaaa 396 

<210> 70 

<211> 536 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feacure 

<222> (1) . . . (536) 

<223> n = A;T,C or G 

<400> 70 

accagcgcaa aagcaaatat aaacatcgaa aaggcgttcc ccacgccagc tgaagacacc 60- 

ctccgaaaga cccctgtaaa agagcccaac agtgaaaacg cagatatcag cagtggagga 120 

ggcgtgacag gctggaagag caaatgcrgc tgagcactcc cccgttccat cagtcgccat 180 

ccactacccc gttctctctt cr.tgctgcaa aataaaccac tctgcccatt tttaactcca 240 

aacagacatc tttgtctctc atcctaacca tccaagccac ctactttatt tgttctt-ica 300 

tctgtgactg cttgctgacc tcaccacaat cttcttcaaa caaaaaaatg tacagaaaaa 360 

tcacgcccgc gacctcattc ttaaatgnta cctgcccagc ccaactgcau ttcagtcgtt 420 

ctatagccca gctcctacca acattnaaac ctacngcaat canctcaaac ctattctgca 480 

aaccgcataa gaataaaagt tagaatttaa caatcaaaaa aaaaaaaaaa aaaaaa 536 

<210> 71 

<211> 865 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (865) 
<223> n = A,T,C or G 

<400> 71 

gacaaagcgt caggagaaga anagaggcag ggaanactnc ccaggcacga cggccncrtt 60 

cccaccagca accagcgccc cccaccagcc cccaggcccg gacgacgaag actccazccc 120 

ggactaatcc nacctccntc gcctgnccca ttcccacctc ggaggcggag gccggaaagg ISO 

ccncaccaag aganaanctg ccgccaacac caaccgcccc agccctggcg ggcacganag 240 

gaaactggtg accaacctgc agaatcccna gaggaanaag cnaggggccc cgcgcrnaga 300 

cagagccgga tatgangcca gaccatggac nccacncccn ncaatncana cgggacrgcg 360 

gaagacggan gacccncgac nngaccaggc cngcznncca nccccccacc cccatgaacc 4 20 

acccccgccg aangaatccc tgannggctt ccannaaagc gccrccccnc cnaacgnaan 430 
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<221> misc_f eacure 
<222> {11... (437) 
<223> n = A,T,C or G 

<400> 74 

accagttcag actgccacgc caaccccaga aaatacccca cacgccagaa aagtgaagtc 60 

ctaggcgtct ccatctatgc ttcaacccgc ccatctacca ggcctcgcga taaaaacaaa 120 

acaaaaaaac gctgccaggc tttanaagca gctctggtcc caaaaccatc aggaccctgc 180 

caccagggct cccrtgaaat agtaccacat gtaaaaggga atccggcttc cacttcatcc 240 

aatcXccgaa ttgccaggcc ttgattgaca actgtagaaa taagtagcct tccgttgtgg 300 

gaataagtta taaccagtac tcatctctcc gcttttcgtc actcctctc!: ccctnattgc 360 

gtcatttgca ctgctcgaaa aatatttcct ctataaaatc aaactaaccc gccctaaaaa 420 
aaaaaaaaaa aaaaaaa 

<210> 75 

<211> 579 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> {IJ . . . (579) 
<223> n = A,T,C or G 

<400> 75 

ctccgccgcc gccaagacga tgtgcggggc gccccccgcc acgcagccgg ccaccgccga 60 

gacccagcac atcgccgacc aggcgaggtc ccagcccgaa gagaaagaaa acaagaagct 120- 

ccccgtgttc aagcccgtgt catccaagag ccaggtggrc gcggggacaa actactccat 180 

caaggtgcac gccggcgacg aggaccccgc acacccgcga gtgtcccaac ccccccctca 240 

cgaaaacaag cccttgacct tatctaacca ccagaccaac aaagccaagc atgatgagcn 300^ 

gacctatttc tgaccctgac trcggacaag gcccctcagc cagaagaccg acaaagccat 360 

cctccgtcta ccagagcgtg cacctgcgat cccaaaataa gcttcacctc cgggctgtgc 420 

ccctggggtg gaaggggcan gatctgcact gctttcgcat ttctcttccc aaattccact 480 

gcgttgattc cctccttcca ataggtgatc ::tnaccact.t tcagaacatc tcccaaacna 540 

gacatatttt naaaatcctt aaaaaaaaaa aaaaaaaaa 579 

<210> 76 

<211> 666 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (i; . . . (666) 
<223> n = A,T,C or G 

<400> 76 

gtttacccta tcrctccaac cagattgcca gctccctgag ggcaagagcc acagtacatt 60 

tcccrgctcc tcccacagcg cctaataaca ctgcggaact aggttttaac aatttctcaa 120 

ccgacgttgt tacgggcagg acggcaacca gaccattgtc tcagagcagg cgctggctct 180 

ttcctggcta ctccacgntg gctagccccc ggtaacctct tactcattat cttcaggaca 240 

cccaccacag ggaccaggga cgacgcaaca cccccgtctc tttacgacag gacgttiigcr 300 

cagcttcccc aacaacaaaa agcacgcggc aaaacactcg cggacattct ggactgcrtt 360 

caaaaaatat acagcrracc gaaaatcaca rcatcttaca acgaaaagga ncccatagac 420 

cagccagcga acaacccttc cccaccatac aaaaacnccc ccccccgaan gaaaanagct 480 
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<22Q> 

<221> misc_f eature 
<222> (1) . . . (456) 
<223> n = A,T,C or G 



<400> 7S 

actagtatgg ggcgggaggc cccacccttc ccccccaggc gccgttcttg ctccaaaggg 60 

ccccgcggag agggactggc agagctgang ccacctgggg ctggggatcc cactcccctc 120 

gcagctgtcg agcgcaccta accactggtc atgcccccac ccctgctctc cgcacccgcr 180 

tcctcccgac cccangacca ggctacttct ccccccccct tgcctccctc ctgcccccgc 240 

tgcccccgat cgcangaatc gangantgtc ccgccctgtg gctganaatg gacagtggca 300 

ggggccggaa acgggcgcgc gcgtgtgtgn gtgtgtgtgc gtgcgtgtgt gcnccccccc 360 

cgcaagaccg agactgaggg aaancatgtc tgccgggtgc gaccatgttt cctccccaca 420 

aancncccct gtgacnctca naaaaaaaaa aaaaaa 456 



<210> 80 

<211> 284 

<212> DNA 

<2i3> Homo sapien 

<220> 

<221> mi3c_f eature 

<222> (1) . . . (284) 

<223> n = A.T,C or G 



<400> 80 

cctcg-acct ctagaaaaga caggtaccgt gtcacgaaac tcgagcttaa actccacaTia 60 

taaaaccaaa agtaatgctc actccagcaa cacataccaa aattggaacc acaccgagaa 120 

gaacagcatg acc-ccgcgc aaacaggaca agcaaattcg cgatgcgtcg actaaaaaga 180 

aacaaacaaa tgcgcatacg tgtaactngc atgttcatgt ggaatacaga ttgggaaaLa 240 

aaatgcattc cttactgcga aaaaaaaaaa aaaaaaaaaa aana 284 



<210> 31 

<211> 671 

<212> DNA 

<213> Homo sapien 

<:220> 

<221> misc_feature 
<222> (1) . . . (671) 
<223> n = A,T,C or G 



<400> 81 

gccaccaaca ttccaagcta ccctgggcac ccttgtgcag tagaagctag tgagcatgcg 60 

agcaagcggt gtgcacacgg agactcatcg tcataattca ccacctgcca agagcagaaa 120 

gaaaggctgg ggacatttgg gttggcttgg tcttgattct ttgcttgttt gtttgtrttg 180 

cacuaaaaca gtactatctt ttgaatatcg tagggacata agtacataca tgttatccaa 240 

tcaagacggc tagaatggcg ccttcctgag tgcctaaaac ttgacacccc tggcaaatcr 300 

ctcaacacac tcccactgcc tgcgcaatga agtcccgact catttttaac cactggaat:- 360 

ctccaacgcc gtcactccca gttagacnat ctcgcacttt gagattaaaa cgccatgccr 420 

atttgactag tctT:atcttt: ttatttttac aggctcatca gtctcactgt tggccgccac 480 

tgcgacaaag ccaaacaaac ccccnaggac aacacacagt atgggatcac ataccgttrg 540 

acacraagcc ccggccaaaa aatgttgcac gcgtttcacc tcgactcgct aaaccaatan 600 

canaaaggcc ggcznataac gtcggnggcg aaataatcaa tnantaacca aaaaaaaaar. 660 

aaaaaaaaaa a si I 
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<210> 85 

<211> 771 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 

<222> (1)...(771) 

<223> n = A,T,C or G 



<400> 85 

aaactgggta ctcaacaccg agcagatctg tccttcgagc taaaaaccac atgctgtacc 60 

aanagcttgc tcctggccgc Ctcgatgtca gtgctgctac tccacctccg cggcgaatca 120 

gaagcaagca accctgactg ccgccttgga -racacagacc gcacccrtca tcccaaaccc 180 

accgcgggct tcacacggca gccggccaat gaaggctgtg acatcaacgc tatcatcctc .240 

cacacaaaga aaaagttgtc tgtgtgcgca aatccaaaac agacccggat gaaacatatt 300 

gtgcgtctcc tcagtaaaaa agtcaagaac atgcaaaaac cgtggcctte ccggaacaaa 360 

actggacata gcccaagaac agaaagaact tgctggggrc ggaggcrtca cttgcacacc 420 

acgganggcc tagtgctcac cccatccgtg cctcctggac ctgcccaacc nacgaagcta 



<210> 96 

<211> 623 

<212> DNA 

<213> Horiio sapien 

<220> 

<22l> misc_feacure 
<222> (1) . . . (628) 
<223> n = A,T,C or G 



430 



accacactgc atcatantcc gcttcgttta acatcacatc naaattaaac tgtatctcat 540 



600 



gtcacccata gccncaggct ttctgcgctt aactttccat acnaancccc ccaaactact 

tcggcncant gcaanctaaa aaccatactt ggggggggaa caaacatcgg antttccgca 660 

gccacaagct cttcttaaaa aaccantaca nccnngttaa acggtnggtc ccnaatggr.t 720 

cttgcttttn antagaaaac ctntcagaac nattcgaaaa aaaaaaaaaa a 771 



<400> 86 

accagcttgc tttacatttc tgaaaagtat tatctttgcc caagtgctta tcaactaaac 60 

cttgtgttag gtaagaatgg aacttattaa gcgaaccagc gtgacccttc ttgccataag 120 

actatcttaa agctgaagcc aaaatatgct tcaaaagaaa angactttat tgttcattgt 180 

agttcataca ttcaaagcat ctgaactgta gtttctatag caagccaatt acatccacaa 240 

gtggagaang aaatagatta atgtcnaagt atgattggtg gagggagca^ ggttgaagat 300 

aatctggggt tgaaattttc tagttttcat cctgtacatt tttagttnga caccagactt 360 

gaaacattaa tgtttacctt tcaatgtgtg gcatcagccg gactcantaa cacccctttc 420 

ttccctnggg gacggggaat ggattattgg aaaatggaaa gaaaaaagta ctcaaagcct 480 

tcctcccnca gcttctggct cctaccctac cgacttancc agaataagaa aacaccttat 540 

caccntccgc tttatcccca tcaatnaant tctgatgaac aaatccgctt ttatgcnnac 600 

ccaaggaatc nagtggntcc ntcnttgt 628 

<210> 87 

<211> 518 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc feature 



SNSDOaO: <WO_9«7674A?JL> 



<l~<wfrZ9iW6 ^0M> :aiooasNi 



fNQ <ZTZ> 

zzs <xxz> 

68 <OXZ> 

^^81 eeee BBBeeeeBHe eeeeeeHeeB BaoBaeGBas BerjetiatieeB 

0081 aBBD65BOBB BB:ieB:iHqa5 uaB^BBtje^E eDBBBqeDOB BBaaEa£::B4 B00BB6:ia3:i 

OfLX BDBaBa:iB:i6 :jB5aaoB30B :jDaoBBBB2= r-UBBBB^BoB BB:jaa:3£::ao bbbbobobi^ 

0891 z^zzBo5:iDa B^B^BBBBaS :iBDDBaq:iB£ zubbbbb^uo b6:ib5Bb:i:;U o^BBoaBBBB 

0Z9X BBBBBo:j:i:iB BBaa:i:i:jBaB ubbb:j:ibbbB u:ibbb6b6bB BB^BUB^afiB BBBqBB:i6Bq 

095T B5oB:iBDDBB BB6:i6B:i6a2 BBBa:>63aB3 6:ioBaaqaBB BBBqBBBfiaB DOBqqaBBOO 

DOST DBBBBqqofiD oBqqODBBqq ^^BOBbBBD^ OO^BBBBD^q B^BuBBbbBO q^BBBB^aaB 

O^^t oqaOOBB^BO DBBOBB:jaBB BoqBBfiBBaB oBrjBqqoBqo qooB^BuaaB BqoBBDfiqBO 

08ei boBbBb^o^B BDBqoqoBDq DfiBBBoBBoq cqqoBaqqDo qBqoBaDBqo oqqoq36:iaB 

OZZZ OD-aBBBBBB BqDo:i6BBqB B^odBboBbo BqBqooqBfiB ^oBbbBbBBB BBBq^D^BOo 

OSZX ooaBqBBDqq ^dBBbbdbbd BqoBqoBqBB BBBqBfiBOBo oqqoBqBaOo BboBbdB^Bb 

0021 BfiBBBaoBBO BbBb^oB^BB bd^BbbBbdc b^obB^odob qBBqoo^aao oqooqoBqBB 

O^TT BaoDOBOOOD oaBqBBBqoo o^BBbBbbbB BBBBqDoqBB aBqBBBBBBB qoDBoqqqoB 

080T iBcaqBqBBB o^bb^Bbbbd B^bbBB^bBb oBqqqaqBan ^bbb^odbbb bd^^b^bdob 

0201 c223:iaBDqq oaqqDoaaqq B^BaqqaBBfi bob^bbbb^b qBqBaoBaao oqqeqaBDoq 

096 G25BDqB3qB naBBsBOBoq BBaoBBqica BqqBOBBaDB qqoBacoBoa coBaoooBoB 

006 bbbBbubood BqqaqoDOBO jqBQBBBBoq qBoqBBaoqq iBDqBBBoau oqoBBoBaBB 

0^8 qBsaBBaBOO BtioB^BoBaB oaqoBoaBqB aBfiBaqqBBB bboBb'-BBb aaBaBBaBBi 

OQL BBBBfiqqoBB bbubb^bBBb BaqaoBBBua qBOoBBOBap Bobbb^^bbb qqqaoBqqoo 

OZL BsBi-aqqqqB qcqooBBqfiB BaBqqBBfiBB BBBBBasqaB BBbbbbbo^B BBoqqqaBBa 

099 BBsBqBBqqB b^BbbbbBbd oqqBoBBoBB BqBqqBqBae qBBBqBqrqB BoBBBeqBOB 

009 OBBaBBfiBqo BBqBBOoBBB OBBBBoBBBq Bqa:;qqq3DB B^bobbbbob iBBBBBiBaq 

Ol^s liuBBBDfiqoq BqBBOBBqaB qBarisqBaBc BBasBqBaBq BBqaaBOB^B oBBqBaBOBa 

08^ BBczoBoaBq BBBqaqanae oqoBBaBBBB BqqqoqBBqq 5a6q3C-::BBB OBBaBBoaoq 

02^ BDBOBUDDaq oojoBbB^ob 0BquqaB3BB bbBbbbod^b :ioqB6B£BB6 3q:i33cqBBq 

09e azacaBqfiqq i-qBOBBUBBB aBa6Ba:za6B BBBsBaq^Bo BBfiBaBraBq bbbBb^bbbB 

OOe UuiobbBbBb BqBBDBoqoq qBqBOBOzzi BqB:iDqBOBq DBqqqBCcaq BqfiqeaqoBB 

O^Z BjBaBBBBBB joBBBoqoBq qqqoqBaBqB aBqBoofiBBB 6£qo6a££zq 3qqDqB0c:;q 

08T 3--aBB6Bq6 BoBqaBBiBs bBBBbbbbbB BqqoBaoBBB tia-HBoBBBB UDBqquBqBB 

OZX BBOBoaBBfiB BBBBBBBBBB BBBBBBBBBO q^qfic^aoBfi BB3qn3:.qBa zqqBqzqqBq 

09 Z-zqBqqsBq qBaqBBBBBB BBoqooBB-:; qqaBBBBBBo OBafiBZCcqB bBqBbdbBbd 

96 <00l'> 

uatdBS O'JUOK <ZXZ> 
VNG <ZXZ> 
^f8T <XXZ> 
88 <0I2> 

dddbBBBb bbbdBbbbbb BqaBocDcoo BbBoubbbb^ 
08fr £zOBBBqDqo BqqqqBoBao BsqqqBBBDB oBbbBboBbb qBOoBa&oqo ooBBqqaBBU 
02^ q£zqB6qB:iq BBBBBqBBBO qqoBBfiqquc coBqBBuqoo qBBBBBqaqo BqBBBBaqBB 

09e BEoBBofiBBq qqqfiSfiqqBB BqBBoBoqqq BoooqqBBBB oBboB^bbob BqqqBsoBqo 
OOe BccqqqBBqB DBBqBqoBqq BqqqooBBqB qBqqoqDqoq qqBBoqqqBB qqqBOBOBBB 

O^Z BBiqqBBBBq qqqqqqqqaB qqBBBBBqco qBoqBBBqqq qqBBoqBBBO BBqBDBqqqq 

C8X BOBqBfiBDfiq BBBBBBaBqfi qqqqfiBBOBB BBqqoBqBqq qqsBBBqqqq BBOBqBBaBB 

031 BD^BOBqBBq BqfiqBBBBDq qBBqBqBDBq BBqqqBBqBq qqBqoB^Bqq BDBBDBsqBq 

09 qHiaqqBqoo BqqaqqqBBB qBqqqaqqaq DBBoqqoBqB BBBBBqa-qq qqqBqqqqqq 

LQ <00t'> 

0 = u <ZZZ> 

(8TS) • • * {!) <ZZZ> 



Si 



S6LS0/66Sri/X3d 



PL9LPm OAV 



wo 99/47674 



PCT/US99/0S798 



<:213> Homo sapi n 
<220> 

<221> misc_featurs 
<222> (1) . . . (523) 
<223> n =s A,.T,C or G 

<400> 89 

cttttttttt tttttttagt caatccacat tcactgatca cttattatgc accaggcact 60 

gggataaaga tgaccgttag tcacccacag caaggaagaa aactagcaaa caagacgatt 120 

acaacatgac gcagaaaacg ctaagccaga gacatagaaa ggtcctattg ggtccccctg 180 

tcaccttgtc tttccacatc cctacccccc acaggccttc cctccagctt cccgcccccg 240 

ctccccactg cagatccccc gggattttgc ctagagctaa acgagganat gggccccctg 300 

gccccggcac gacttgaacc caaccacaga crgggaaagg gagccccccg anagtggacc 360 

acttcgatna gaaaacacat agggaatcga agagaaantc cccaaatggc cacccgtgct .420 

ggcgctcaag aaaagcctgc agaatggata aacgaaggat caagggaatt aatanatgaa 480 

taaccgaatg gtggcccaat aagaacgact ncnttgaang acc ' 523 

<210> 90 

<211> 604 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) . . . (604) 

<223> n = A,T,C or G ' 

<400> 90 

ccagtgtggc ggaacgcaaa gactaccccg gaagctttcg agaagctggg attccccgca 60 

gcaaaggaaa cagccaatac gcgtcgtttc racgaaanga agccagaccg agatgtcaat 120 

ctcacccacc aactaaatcc caaagtcaaa agcttcagcc agtccacccc agagaaccag 180 

gggagccctc aagggcacgt agaaaatcag ztgctcagac aggcctctgc accacacagc 240 

ctctttcctc tctgatcctc tccctcttta cggcacaaca tncatgtttg acagaacacg 300 

ccggaatgca attgtttgca acaccgaagg actccctgcg gtcgcccctt cagtaggaag 360 

cactgcattg gtgataggac acggcaattz gatccacatt caactcgcta gtcagcgata 420 

aggggtggta cacccgtttg gtaaaatgag aagcctcgga aacttgggag cttctctccc 480 

accactaatg gggagggcag attattaccg ggattcctcc cggggtgaac taatttcaag 540 

ccctaactgc tgaaattccc ctnggcaggc tccagttttc tcaactgcat tgcaaaatcc 600 



cccc 



604 



<210> 91 

<211> 858 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_f eature 
<222> (1) . . . (858) 
<223> n = A,T,C or G 

<400> 91 

tccctctctc ctttcttcta tgattattac trccrrcacc gatccccaca ccctcagcgt 60 

cggcagagtc tctgatgctc aacaaacacc tgtcccgatc agacaagtgg aaaaaatrgc 120 

catttcctca ticcaagccac gctttcctgr gacacrctga ccctagtcga acacacaaaa 180 



iNSOOOD: <WO__9947e74A2J_> 



OT'S eD3cz^:iouo ccdduudbdo L:DUD6a:;auu Buubbb^^ou DoocoeeuDo BPuaaBHu6u 

OSr uBH::-o6aa6 £556Eocoub euaa6uuuui£ cacouBBeBB cooBBBCDzn oouuooccos 

oet 3-aBBBDDou cr;ooaBu:5:i6 aaBBB6au62 £3cou:i::j6ua oubbb3o566 auoDUBB:ja5 

C9£ D6£au3uouD 6u2^BB::a66 BBB5a:jBBa3 ^003:1:16^:53 ^dddooudbb b66u66ouod 

OOZ 02o£ou6665 555uuuuuuu uuuuuuuuuu uuuuubb6:i:i ^d:i:j:563UB3 o6q:iBBB:j:iB 

OfZ BoqBBBBBUB uBaBBB556q :i3a:ia:io6::u qBuBaBeaoB DBaeBaBaB^ ao3:3aa6BDD 

08T dSbboodBdo £36a:i03BB6 6BUBa6a3:i6 3BDaao6:i:io odoo^ooboo oB:i6ac3o:ia 

OZT DBB663qD3B 65BB535qD6 ao6665630B b656bb65ub o65B365553 oBBoBaoBOB 

09 ^DO^DD^BSB 335BaoBB53 oBB^oubbBB ^aooBOO^B^ BoBao^B^oB q^BaBBoBBo 

£6 <00fr> 

0 D'X'V = u <£ZZ> 
(LSS) - • (I) <ZZZ> 
sjn^Baj osTui <xzz> 

<ozz> 

ustdBS ouiOH <ZXZ> 
VNQ <ZXZ> 
LSS <TZZ> 
tS <OXZ> 

ouBDa ^B^oobbbBB Bco ^uBoo^a a^oBoooB^a oBOOZoBoBa 
0^5 aBu£aaBB33 BOBo^oBuao BbB^BbB^^b BaooBaBBBB BaooBBBBBq B^Bubu^udB 
08t bbBBcodBbB 0B3BDBBDUO UOD33BBOBO ^oBBoD^Ba^ B^^BBBBaBq 6333333630 

02!^ BB3B03BB3e C3BB33E36B 3306060633 BB:;3665b63 6B33303333 63333o£boo 

09f 3o6b5636Bo BooeooBooe £o66Bbb6uu ubuububbbu uubu36D3bb 3BB3BBBBbb 

OOt BB3BB33B6b BB363b66b6 33BOBB3363 030BBB6B3a B05bB33B33 363BBBOBB6 

Q^Z B03B6303B3 30B3033333 3BBOB63BBB 66bB3BB633 366lBOBBBe B6BOeC3B3B 

081 b6B01bB6B3 3BB3336363 530333B333 B3B6B33335 306366Bfi33 BB6b6ob5B3 

QZX B3a£330366 130BB53BbO B03B5BBa-3 B-305bB306 e3333BOOC3 63B030B3Da 

09 OOB33B63D3 DBB6306033 033303333B 6b55BOB3B3 3b6b636630 0303BB6a36 

Z€ <00l'> 

O ao o'x'v = u <zzz> 

(SBS) •••(!) <ZZZ> 

SJn3BS5~OSTUl <XZZ> 
<OZZ> 

uaxdBS ouiOK <£IZ> 

VNQ <ZXZ> 

S8S <XXZ>- 

Z6 <OXZ> 

6666bb53 D00366BB66 



es8 








0^8 


666e33b633 


33BBBBB303 


UB6U3333B6 


08Z. 


033U666663 


3BB33U336B 


UU63003333 


OZL 


03333B033U 


0 3663353B3 


3OO636BO6B 


099 


636BBU0B60 


63BBBB3 3B3 


606636B363 


009 


633BBOBB3U 


33O03633B6 


30B363336B 


O^S 


03B0B0BB36 


3330633BB3 


30BB033B33 


08^ 


3336036336 


B3DBO33636 


0B0B33B360 


QZl^ 


6666c6b633 


33B603B3OB 


3B633B3306 


OSZ 


5636b 3 ob66 


33B3B33B03 


33BB63BB33 


ooe 


5bbb6bb636 


B0333B0B03 


6b365b5636 


0^2 


03B63533B3 


336bB30B66 


B3BB3B3036 



S6LS0/66SfyJDd 



wo 99/47674 



PCT/US99/05798 



38 

nttcggggaa aaccctnccc gcgccca 5g7 

<2X0> 94 

<211> 620 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (620) 
<223> n = A,T,C or G 

<400> 94 

actagtcaaa aatgctaaaa taatttggga gaaaacattt tctaagtagt gttacagtt" 60 

catgtttatc ttttactatg tttcgtgaag ttgtgtcttc ccactaatca cctataccac -120 

gccaatattt cctracatct atccataaca tttacactac atttgtaana naatatgcac 180 

gtgaaactta acaccttata aggtaaaaat gaggtttcca anatctaaca atctgatcaa 240 

gttcttgcca cctccaaata gaacggactt ggtctgctaa gggccaagga gaagagaaaa 300 

ataaggtcaa aagctgctaa tgaccaaaca ccctaaaaga aatgcaaaaa aaaagtccac 360 

tctcaagcct tcgaaccacc taaggaaagc aaaaccattc cctaaatgca caccatccgt 420 

gagaattcct cauraatatc ctgaatcact catttcacta aggctcatgc tnactccgat 480 

atgtCLCtaa gaaagtacta ttccatggtc caaacccggt tgccacantt gggcaaaagc 540 

ttccccttaa gtgtgaaanc actcaaaatg aaattctcct ctttctaaaa attctttana 600 

aggg--aagg gcgctgggga ^20 

<210> 95 

<:2ll> 470 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc^feature 
<222> (1) . . . (470) 
<223> n = A,T,C or G 

<400> 95 

ctcgaccttc tctgcacagc ggatgaaccc tgagcagctg aagaccagaa aagccactac 60 

nactctncgc ttaattcang agcttacang atccttcaaa gagtgngtcc agcatccttc 120 

gaaacatgag ttcctaccag cagaagcaga cctttacccc accacctcag ctccaacagc 180 

agcaggtgaa acaacccatc cagcctccac ctnaggaaac atttgttccc. acaaccaagg 24 0 

agccatgcca ctcaaaggtt ccacaacctg naaacacaaa nattccagag ccaggctgca 300 

ccaaggtccc tgagccaggg ctgtaccaan gtccctgagc caggttgcac caangtcccc 360 

gagccaggat gcaccaaggt ccctgancca ggttgcccaa ggtccctgag ccaggctaca 420 

ccaagggccc gngccaggca gcatcaangt ccctgaccaa ggcttatcaa 470 

<210> 96 

<211> 660 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature ' 

<222> (1) . . . (660) 

<223> n = A,T,C or G 



INSCXXaO: <VVO 9947674A2 I > 



QOS eeceeeoDaa Beoaaaouoa aeeeriaHono OH:4aeqao2:j oBeonaeBaB eBo56PHB6:i 

Oi-s eoBHa^aeBe eeeB^BB^oz oa:iee6Boac 6ao6BD3:ja:3 SenoueBeoe aupp:i:i0366 

081^ ee62D6Be5:i eeSaoaoaBe BaotioeoBOB :i 00^:1:10:1:1:1 oBeeBoPiiDB iiB^bbBbbbb 

OZ^ BBOoooo:i05 fi:ioa:iBoBB:j aeBoBauooE aa:iBDooa6e B3BaBa6ao:i :iBOODaBaoo 

09e ooaHB5:io6:i buhdobooBB ooBcBBBfiaa ooo6:ihoo6b OBOoa56:iB:i bb^BbbBboB 

00£ bboobubbBb oBboooBboo bdBbdo:iobo :i6b:jbbo:i:jo odB^oooBbB aooB:iBBBBo 

0^2 00DB00B3B0 ofiBBBOoaaB BbbdooBbod fiqDODBBB:io oB^Bbbbooo OBOoBaoooB 

081 BBaooBaBBB bodoobdoBz oooBbBBbbo obbbboooo^ boB^boobbB 6Bo:iooBooa 

OZX ooBBOoBaao oBbobbbB^B BboBboBboB Boa:io6Boqo doooboooqb o6:ioooBbdB 

09 bbBboBboBb ooo3:iBbB3b oBeoBn^Bao booBboobBB boobobd^i^o 3o:ioo33b:5B 

86 <00^> 

0 JO o'i'Y = u <ZZZ> 
(009) - * (X) <ZZZ> 
san^BSj osTui <zzz> 
<QZZ> 

uaxdBS ouiOH <zxz> 
YNQ <ZZZ> 
009 <11Z> 
86 <OZZ> 

'^^^ B BBBBBBBBBo ooooac:;fiao 

OZ^ :i3Bo6B:iaBB o6a:iooBqa3 BOOoaBBBaB qfia6ao3:iBo oo3B:iooooa bbB^oB^bBb 

09e 00B00B600B B6BBBa:»ooc B^booBbobo oaB6:iBiBB:i BbbBbdBbbo obubbBboBb 

ooe oooBbodboB BDoaoBoaBB aBsouuoooB ::ooo6BBaoo BoBBbboooo booB^booBb 
O^Z BBCoaqBBBB oooBBOofiac oo6bB:jooB2 Bbbboooobo o6:ioooBBB3 ooBiBBbboo 

09T 00B00B3000 BbBBbboobb BBOoocqBoB zboobbBBbo :;ooboo:idoB BooBaacoBB 

OZX dbbb6:iBBbo BboBboBbozj uoBBOziooori oBoooaBoB:; dooBboBbbB boBboBbooo 

09 aa6BB:iBo6B oBzjaBr^OBOo BboobBBboo bobd3:io:io3 coQtiBaSBUB ob^boobBBB 

^6 <00>> 

D JO o'x'v « u <ZZZ> 

dfrf') (T) <ZZZ> 

ajcn^Baa osxui <xzz> 
<OZZ> 

uaidBS OUIOH <£XZ'> 

VNQ <ZXZ> 

Tfr> <XXZ> 

LS <QXZ> 

DouBBB^^oo obBBBbuuoB 

3:ioobBBubo qoBBBaoouB 

aBB^tlooBBB 6:iBBq:iOBBB 

Bfiaa:50BBBB BDBOuBaooB 

B^ooBBB^oB B6ao6ao:iao 

:i:ioBBa3BBu BBuo^boBbo 

BZ036BBBDB aqBBaoBiBa 

:i^aaaB:iBoB a6BqB:;a3oB 

:iaBB:iaoB3o :i:i:idbBbb6:i 

qBBBoa^Bo:! qijaoa^^BOfi 

31:1:1:1^33:13 3333333333 
96 <00^> 



6e 



099 06B3300UBU OU3UB63330 oobB6Bbu33 0B6B3000BB 

009 B3033B0BUB 633OOOB6BB BBO3306B33 OOBBOUOUB3 

OfrS UBUBB3300B 6BBB030BB3 0B330B3BB3 30Ob6B6bbo 

08> BBBO30663O DB0BB3BB33 30Bb66B030 B33OO3B3B3 

02^ B3B3B3:i3oo B3BB3030BB 3BOBB6bbB3 3B00B63BBB 

09e UBB3 306303 30B33BBBOO BOB3B303B3 30666bB030 

ooe ouB3BBBb6b bBBB3bBbob bbBbobdbBb b33BB0bB6o 

O^Z bbBbBiBBBB 330BB3B30B BbBb6b6bb3 BBBOB3BBBb 

081 BBBeoBBB36 363B3BB6BO 33033b66B3 B330BBBBbo 

OZX booBbbbbBb oobBbbBooB B0BbB3000B BB3BBb0330 

09 3BOBBBBBbO BbBbdBBBbB 3BB333BBD6 BBBB33BBfi6 



86Z.S0/66Sfl/XDd 
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<210> 99 

<211> 667 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) , . . (667) 
<223> n = A,T,C or G 



<400> 99 

accagcgact gagtccctgg caaagaaatt cgacctggac cagttgataa ctcatgtccc 60 

accatttaaa aaaatcagcg aaggatttga gctgctcaat tcaggacaaa gcattcgaac 120 

ggtcctgacg ccctgagatc caaagtggca ggaggtccgt gttgtcatgg tgaactagag 180 

tttctcttgt gagagttccc tcatctgaaa tcacgcacct gcctcacaaa cacaagcata 240 

agtagaagat ttgctgaaga catagaaccc ctacaaagaa ttactaaccr ttataaacat 300 

ttaaagtctt gcgagcaccc gggaatcagt acaataacaa tgttnatatc tttgatttac 360 

atttcgcaag gctacaattg tatcttccaa gaaaacacac ctcgganctc tatgttgaaa 420 

tggagatttt caagagtcct aaccagcrgc cgcagataca ttacccaaaa cagacatagc 4SD 

gtacaaagac atagtaaacg cacctcccag agtaacattc acttaacaca tcggaaacca 540 

ctatttttca gattcgaata cnaatgccat ctttcaaaca ctcgttatga gctacctggg 600 

attacatttt gaaaccagtc catcccacga tgcanattac tgggattaga ttaagaaaga 660 



cggaaaa 



<2I0> 100 ' 
<211> 583 
<212> DNA 
<213> Homo sapien 



667 



<220> 

<22l> misc_feacure 
<222> (1) . . . (583) 
<223> n = A,T,C or G 



<400> 100 

gctttgtttg caagatgatc acagtcatgc cacactgatc taaaggacac acatataacc 60 

ctttaaaaaa aaaatcactg cctcattctt atttcaagat gaatttctac acagactaga 120 

cgttcctctg aagatcaatt agacatcttg aaaatgattt aaagtgtttt ccttaacgtt 180 

ctctgaaaac aagtttctct tgtagtttta accaaaaaag tgcccttttt gtcactggat 240 

cctcctagca ttcatgattt tcctttcata caatgaaatt aaaactgcca aaatcatgga 300 

ctggccttct ggttggattt caggtaagat gtgtttaagg ccagagcttt tctcagcatt 360 

tgattttttt ccccaatatt cgacctttta aaaatataca catnggtgcc gcatttatat 420 

ctgctggttt aaaattctgt catatctcac ttctagcctt ttagttatgg caaatcatat 480 

tttactctta cttaaagcac ttggcnactt ggancatctg gttctannct aaaaaaanta 540 

attctatnaa ttgaantttc ggtactcnnc catatttgga tec 533 

<210> 101 
<211> 592 
<212> DNA 
<213> Homo sapien 



<220> 

<22l> misc_feacure 
<222> (1) . . . (592) 
<223> n = A,T,C or G 
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<210> 104 
<211> 575 
<212> DMA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1)...(575) 
<223> n = A,T,C or G 

<400> 104 

gcacctgctc ccaatccnnc tctcaccatg accctccgcc cgcanaaacc cctccaccaa 60 

ctatggangt ggtctcnggg gtggctctcg ccaactggga agaagccgtg gtgtc^ccac 120 
ctgtceaacc cngtttgtgt ctgggggatc aactnggggc tacggaaacg gctnaaccgc ' 180 

tgccctggtg gaagggcrgg taattggcct tgggaagcng cttacngaaa ttggccrnog 240 

gaagtcgcta ttgaaagtng ccntggaagt ngncttggtg gggggctttg ccgqcgaccc 300 

ctgttnaatt cggg-gcttt gtnaatggcg gccccctcnc ccgggcaang aaaaaaacca 3 60 

ccnacgcngn aaaccccnac nnaacagcct gggctcccct cacctcgaaa aaagccccrc 420 

cccccccaaa aaaggncaan cccctcaann tggaangccg aaaaaacccc cgaacggoga 4 80 

ncccnaaaac aaaaancccc ccncttcccn gnaanggggg aaataccncc cccccaccca 540 

cnaaaaccct tntaaaaaac cccccgggaa aaaaa 575 

<210> 105 
<211> 615 
<212> DNA 
<213> Homo sapien 

<220> 

<22l> misc_f eacure 
<222> (1) . . . (619) 
<223> n = A,T,C or G 

<400> 105 

cactagcagg atagaaacac tgtgtcccga gagtaaggag agaagctact attgaccaoa 60 

gcctaaccca ggttaaccgc aagaagaggc gggatacttc cagcttccca tgtaaccata 120 

tgcataaagc caatgtagtc cagtttctaa gaccatgtcc caagctaacc gaatcccacc 180 

tcaacacaca ctcatgaact cccgatggaa caataacagg cccaagcctg tggcatgacg 240 
cgcacacttg ccagactcan aaaaaatact actctcataa atgggcggga gcattttggt - 300 

gacaacctac tttgcttggc tgagtgaagg aatgatattc atatattcat .tcattccacg 360 

gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 

tttccaaatt tttgcacagt cgctgcacac atctgaaatc atatattaag acccccaaaa 480 

aatgaagtcc ctggcttctc atggcaactt gatcagtaaa ggattcnccc ccgtttggca 540 

cctaaaacat ctactatacn gttnanacga aattccttct ccccncctcc cgaaaaaana 600 
aagtggtggg gaaaaaaaa 

<210> 106 
<211> 506 . 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) . . . (506) 

<223> n = A,T,C or G 
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<212> DNA 

<213> Homo sapien 

<400> 109 

acccgaggtc tcgctaaaat catcatggac ccacttggcg ccgtcagcac ccgacttggg 60 

ttcgatcttt ccaaagagcc gaagaaaaca aatgatggca acatcttctt tccccctgtg 120 

ggcatcctga ctgcaactgg catggtcctc ctggggaccc gaggagccac cgccccccag 180 

ttggaggagg tgtttcactc tgaaaaagag acgaagagct caagaataaa ggccgaagaa 240 

aaagaggtga ccgagaacac agaagcagta catcaacaat cccaaaagtc cttgaccgaa 300 

ataagcaaac ccaccaatga ttatgaactg aacataacca acaggctgtt tggaaaaaaa 360 

acatacccct tccttcaaaa acacttagat tatgttgaaa aatattacca tgcatccctg 420 

gaacctgttg attttgcaaa tgcagccgat gaaagtcgaa agaagattaa ttcctgggtt 480 

gaaagcaaaa caaacgaaaa aatcaaggac ttgttcccag acggctccat tagtagcccc 540 

accaagccgg tgctggtgaa cacggttcac ctcaaagggc aatgggacag ggagttcaag 600 

aaagaaaata ccaaggaaga gaaatttcgg atgaacaaga gcacaagtaa acctgtacag "660 

acgatgacac agagccaetc ctttagcctc actttcctgg aggactcgca ggccaaaatc 720 

ccagggatcc catacaaaaa caacgaccta agcatgcctg cgcttccgcc caacaacacc 780 

gatggcccgg agaagacaat agataaaaca agtcccgaga aactggtaoa gcgaaccaoc 840 

ccagggcata tggaagaaag aaaggcgaat ccgcacccgc cccggttcga ggcigaaaac 900 

agttacgatc cagaggcggt cccggctgcc acggggatgg gcgacgcccc cagcga^cac 960 

aaagccgact actcgggaac gtcgtcaggc tccgggctgc acgcccagaa gtccccgcac 1020 

agcccctccg tggcagcaac cgaggaaggc accgaggccg cagctgccac tggcacacgc 1080 

tctaccgcca catccgcccc aggtcacgaa aatgttcacc gcaaccatcc cttcccgctc 1140 

Ctcatcaggc acaacgaatc caacagcatc ctcttctccg gcaaaccttc tcccccccaa 1200 

gacgaccgtt gccacggcat tgccgctccc agcaaaaaac aaccaccagc gutacccata 1260 

cgactatgaa aaccgtccat tcctttaaac ggr.ggctcac ccgcattt 1308 

<210> 110 
<211> 391 
<212> PRT 
<213> Homo sapien 

<400> 110 

Met Asp Ser Leu Gly Ala Val Ser Thr Arg Leu Gly Phe Asp Leu Phe 

^5 10 15 

Lys Glu Leu Lys Lys Thr Asn Asp Gly Asn He Phe Phe Ser Pro Val 

20 25 30 

Gly He Leu Thr Ala He Gly Met Val Leu Leu Gly Thr Arg Gly Ala 

35 40 45 

Thr Ala Ser Gin Leu Glu Glu Val Phe His Ser Glu Lys Glu Thr Lys 

50 55 60 

Ser Ser Arg He Lys Ala Glu Glu Lys Glu Val He Glu Asn Thr Glu 
^5 70 75 80 

Ala Val His Gin Gin Phe Gin Lys Phe Leu Thr Glu He Ser Lys Leu 

85 90 95 

Thr Asn Asp Tyr Glu Leu Asn He Thr Asn Arg Leu Phe Gly Glu Lys 

100 " 105 110 

Thr Tyr Leu Phe Leu Gin Lys Tyr Leu Asp Tyr Val Glu Lvs Tyr Tyr 

115 120 125 

His Ala Ser Leu Glu Pro Val Asp Phe Val Asn Ala Ala Aso Glu Ser 

130 135 140 

Arg Lys Lys He Asn Ser Trp Val Glu Ser Lys Thr Asn Glu Lys He 
"5 150 155 

Lys Asp L u Phe Pro Asp Gly Ser He Ser Ser Ser Thr Lys Leu Val 
1^5 170 ' 175 
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acacccgccc caggtcatga aaacgttcac tgcaatcatc ccctcctgtt- cttcatcaaa 126O 

cacaatgaac ccaacagcat cctcttcttc ggcagatttt ctctccl^l agatgatcS lllo 

cgccacggca tcgccgcttt .agcaaaaaa caactaccag tgttactcat atga^tacS lllo 

aaaccgccca ttcttttaaa tggcggccca cccgcatcc gacuacga 1380 



<210> 112 
<211> 400 
<212> PRT 
<213> Homo sapien 

<400> 112 

Met ASP Ser Leu Gly Ala Val Ser Thr Arg Leu Gly Phe Asp Leu Phe 

10 15 
Lys Glu Leu Lys Lys Thr Asn Asp Gly Asn He Phe Phe Ser Pro Val 

'° 25 30 

Gly He Leu Thr Ala He Gly Met Val Leu Leu Gly Thr Arg Gly Ala 

35 40 45 

Thr Ala ser Gin Leu Glu Glu Val Phe His Ser Glu Lys Glu Thr Lys 
, ^ =5 60 

^er Ser Arg He Lys Ala Glu Glu Lys Glu Val Val Arg He Lys Ala 

Glu Gly Lys Glu He Glu Asn Thr Glu Ala Val His Gin Gin Phe G^n 

85 90 . 95 

Lys Phe Leu Thr Glu He Ser Lys Leu Thr Asn Asp Tyr Glu Leu Asn 

105 1-.0 

He Thr Asn Arg Leu Phe Gly Giu Lys Thr Tyr Leu Phe L^u Gin Lys 

115 120 ij": 

Tyr Leu Asp Tyr Val Glu Lys Tyr Tyr His Ala Ser Leu Glu Pro Va- 
130 135 ^^jj 

Asp Phe val Asn Ala Ala Asp Glu Ser Arg Lys Lys Ha Asn Ser Trp 

val Glu ser Lys Thr Asn Glu Lys He Lys Ifp Leu Phe Pro Asd lly 

1S= 170 175 

ser He Ser Ser Ser Thr Lys Leu Val Leu Val Asn Mec Val Tvr Phe 

180 185 190 ' 

Lys Gly Gin Trp Asp Arg Glu Phe Lys Lys Glu Asn Thr Lys Glu Glu 

195 200 205 

Lys Phe Trp Mec Asn Lys Ser Thr Ser Lys Ser Val Gin Met Met Thr 

215 220 
Gin Ser His Ser Phe Ser Phe Thr Phe Leu Glu Aso Leu Gin Ala Lys 
I. 235 '. 240 

He Leu Gly He Pro Tyr Lys Asn Asn Asp Leu Ser Met Phe Val Leu 

245 250 255 

Leu Pro Asn Asp He Asp Gly Leu Glu Lys He He Asp Lys He Ser 

265 270 
Pro Glu Lys Leu Val Glu Trp Thr Ser Pro Gly His Met Glu Glu Arg 

275 280 285 

Lys Val Asn Leu His Leu Pro Arg ?he Glu Val Glu Asp Ser iVr Aso 

290 295 300 

Leu Glu Ala Val Leu Ala Ala Met Gly Met Gly Aso Ala Phe Ser Gl'- 

310 315 
His Lys Ala Asp Tyr Ser Gly Mec Ser Ser Gly Ser Glv Leu Tyr Ala 

330 * 335 

Gin Lys Phe Leu His Ser Ser Phe Val Ala Val Thr Glu Glu Gly Th^ 
340 345 350 
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<210> 115 
<211> 506 
<212> DNA 
<213> Homo sapien 

<220> 

<22l> misc_f eature 
<222> (1) . . . (506) 
<223> n = A,T,C or G 

<400> 115 

cattggcnct ttcacccgct ntggaagcgc nnatctccaa cagtggacaa agcccccngt 60 

gcccnaaacc ccgtnacact: ttcgggaanc gaaaanttng cancacgaca ggctaccctg 120 

angcanagat gttccggata ccatcanatn cgcccccngt gtcagaggcr cataccgcgc iso 

cacgtaaacg gtacnccatt: cgccactatn anccaatcng aaatanggcc cccgggccar 240 

gaatancnng cagcncancr nanangctgc ctgtngcatc cattgcggcc aca^cacctc 300 

acancaccgt aaccccnacc nagtgagaca nactagnaan tccctagtga cggctcanga 360 

ctccaaacgg rxccacnccn aacgttcaaa agccanttaa gcgcaagaaa cacagactgg 420 

atgctccacc aactagtacc cgcaacgacn ggccugtccc aacacatctc cctitttccac 480 

gaccgcggta ncccgcaccg gaaaaa 506 

<210> 116 

<211> 3079 

<.212> DNA 

<213> Homo sapien 

<400> 116 

ggacccccgg gtttcccaaa ccccccacag agtcctgccc aggccaaaga gcaaggaaaa 60 

ggtcaaaggg cagaaaaaac gctgagttag gaggagctat ggaaggataa acccggcctc 120 

aaagaggcca aagcggctta cagggggcgc cgagggctcc ccacatcccc cggcctaaac 180 

cccgcaggca gatctgccca gcgggctctg ggacagctgt gccttcccta acaaaaaaat 240 

tgtgcacaaa aggacgaaac cctattttcc ctctagcaca taaccaagaa tataaggcca 300 

cagatcgcct ttcccagagg gaaaaccctg cagcaacctg ctgcctggaa aagcgtaaga 360 

gcagatcacc ggggaatcgt ttgccccccg ctgatggaca gcttccccaa gctccaaggg 420 
caggtgctca gcatgcaccg tactgggatg gttgtcaata ctcctggtcc cgtaagagtc . 480 

ccaggacact gccacgccaa tgccccctca gtccctggca tcctttctgg gctgcccaca 54 0 

gccccagcct ctatggtgaa gacatacctg ctagcagcgt caccaactcg ttgccaagag 600 

atcagcgctc gaaggcaagg ttatttctaa ctgagcagag cctgccagga agaaagcgrt 660 

tgcaccccac accaccgtgc aggtgtgacc ggcgagctca cagctgcccc ccaggcatgc 720 

ccagcccact taatcatcac agctcgacag ccctctcgcc cagcccagct ctggaaggga 780 

taaaaagggg catcaccgtr cctgggtaac agagccacct tctgcgtcct gctgagcccc 840 

gctcncccca gcacccccca acccactagt gcccggttct cttgctccac caggaacaag 900 

ccaccatgtc ccgccagtca agtgtgtccc ccggagcggg gggcagccgt agcttcagca 960 

ccgcctctgc catcaccccg tctgtctccc gcaccagctc cacctccgtg tcccggcccg 1020 

ggggtggcgg tggcggcggc ctcggcaggg tcagccttgc gggtgcctgc ggagcgggcg 1080 

gctatggcag ccggagcccc tacaacccgg ggggccccaa gaggacatcc accagcacca 1140 

gtggtggcag cttcaggaac cggctcggcg ctggtgctgg aggcggcrat ggcttcggag 1200 

gtggcgccgg tagtggatcc ggtctcggcg gcggagctgg cggcggcrtw gggctcggrg 1260 

gcggagccgg ccccggaggc ggctccggtg gccccggccc ccccgcctgc ccccccggag 1320 

gtacccaaga ggccactgcc aaccagagcc rccrgacccc cctcaacccg caaaccgacc 1380 

ccagcaccca gagggcgagg accgaggagc gcgagcagat caagaccrcc aacaataag- 1440 
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caggtcacgg gacccagagg gcatcggcaa atcactgaag taccacaaag acactcacca 1260 

ccctccagac gatcggatcc agcaggttga aactacccag agaaagattc aggaaaacca 1320 

gcctgaaaac agtaaaaccc tagccacaca gccgaatcaa cagaagacgc tggcatccga 1380 

aatagaaatg aaacagagca aaacggacga gcgccaaaaa tatgcagaac agtactcagc 1440 

cacagngaag gaccacgaac tacaaacaat gacctaccgg gccatggtag atccacaaca isoo 

aaaacctcca gtgaaacgcc gaagaatgca gagttcagca gatctcacca ttcaagagct 1560 

catggaccta aggactcgat atactgcccn ggccactctc atgacacaat acactaaact 1620 

cgccggtgac tcattgaaga ggctggaaga ggaggagacc aaaaggtgta aggagacttc 1680 

cgaacatggg gcacactcag atccgcttra gcgccagaag gcaacagtac ttgagaataa 1740 

caaacttaca ggaaagacaa gtgagtcgga aagaatggta gccgaactaa agaaacaaaa 1800 

gtcccgagta gaggaagaac tcccgaaggc cagggaggct gcagaaaatg aatcgagaaa i860 

gcagcagaga aatgtagaag atacctccct gcagaagaca agggctgaaa gcaaagccaa 1920 

gcagtaccgc agggaactcg aaaccaccgt gagagagaag gaagccgctg aaaoagaact 1980 

ggagcgggcg aggcagccca ccatagaggc cgaggctaaa agagctgccg ccgaagagaa 204 0 

cctccug^at tttcgcaacc agctggagga aaacacctcc accagacgaa cactggaaga 2-100 

ccacctraaa agaaaagact taagtctcaa tgattcggag caacaaaaaa acaaaccaat 2150 

ggaagaacca agaagaaaga gagacaacga ggaagaactc tcgaagccaa caaagcagac 2220 

ggaaaaagac cctgcacctc agaaacacgc agcagagaaa cagtcgaaaa aaaaacagaa 2230 

aatzgaactg gaagcaagaa gaaaaacaac tgaaat-cag cacacacgca gagaaaatcc 2340 

actgccagcg tgcccgatca cacaggctac atcatgcagg gcagtaacog gcccccagca 2400 

agaacacgac aagcagaaag cagaagaact caaacagcag gcagatgaac taacagctgc 2460 

caacagaaag gccgaacaag acacgagaga gctgacacac gaacutaacg ccccccagcc 2=20 

cgaaaaaacg tcatctgagg aaaaggcrcg tttgccaaaa gacaaactag atgaaacaaa 2530 

taacacaccc agatgcccta agtcggagct ggaaaggaag gaccaggcgg agaaagggca 2640 

ttctcaacaa ctcagagagc ttggtaggca aticgaatcaa accacaggca aagctgaaga 27Q0 

agccacgcaa gaagcragtg acctcaagaa aataaagcgc aatcaccagr caaaaccaaa 2760 

acczccraac catgaaaaag ggaaactaca aagagaagca. gacagaacca caagcgcaca 2820 

cgcrgeagc- gagaagaaca ttcagcacit: aaattcacaa acccacccuC cccgagatga 2830 

gaaagaacca gaaagac-ac aaar.ctgcca gagaaaazca gaccarctiaa aagaacaa;:- 2940 

tgagaaaagc catgagcagt tgcttcaaaa taccaaagcc gaaaaagaaa acaacgacaa 3000 

aacccaaagg ctcaacgaag aattggagaa aagcaacgag rgcgcagaga cgccaaaaca 3060 

aaaagcagag gagcttacta ggcagaataa tgaaaccaaa ttaataatcc agagaatcca 3120 

ggcagaatca gagaatacag ttccagagaa acaaactacc cagcaaagac gtgaagcaci: 3180 

gaaaattcag gcagatggct ctaaagacca gccacgcagc acaaatgaac acttgcacaa 3240 

acagacaaaa acagagcagg actctcaaag aaaaattaaa tgcctagaag aagacccggc 3300 

gaaaagtcaa aatttggtaa gtgaattcaa gcaaaagtgc gaccaacaga acattatcat 3360 

ccagaacacc aagaaagaag ttagaaaccn gaatgcggaa cngaacgct- ccaaagaaga 3420 

gaagcgacgc ggggagcaga aagttcagct acaacaagcc caggtgcaag agctaaataa 3480 

caggtcgaaa aaagcacaag acgaatcaca cttaaagacc acagaggagc agatgaccca .3540 

cagaaagatg gttctgtttc aggaagaatc tggtaaactc aaacaaccag cagaggagtt 3600 

tcggaagaag atggaaaaat taacggagtc caaagtcatc actgaaaatg ai^atttcagg 3660 

catcaggcct gactttgtgt ctcttcaaca agaaaaccct agagcccaaa aaaatactaa 3720 

gccttgcgaa acaaacatta aagaacttga aagacagctt caacagtacc gtgaacaaat 3780 

gcagcaaggg cagcacatgg aagcaaatca ttaccaaaaa tgccagaaac ttgaggatga 3840 

gctgacagcc cagaagcgcg aggctgaaaa cctgaagcaa aaaacggacc aacagatcaa 3900 

agagcacgaa caccaactag tcttgcccca gtgtgaaac- caaaaaaaga gcacagccaa 3960 

agaccgcacc tccaaaccag attctgagac gacagtgaag gagcgccagc actccggaga 4020 

gcrgtcccct agaaacactg gacaccctca cccaacaccc agatcccczr tgtcaagatg 4080 

gacccaagaa ccacagccat cggaagagaa gcggcagcac cgggccgccg aacagacacc 4140 

caaagaagtc caatcccagc caccaggggc cccactcgag aaagagaaaa gccagcagtg 4200 

ccaccccgag tacccucctc agacaagcac cgagtcacag acaacttttg atgagacaaa 4260 

ccccaccaca agaccgcccg aaaccgagaa gataagagac caagccccaa acaactctaa 4320 

accacccgtc aggcaccaag acaacgca-g cgaaacggaa crggcgaacg tcctgacacc 4380 

ccragagata gccaagaaca agcac-acga tacgcataca aaacccacaa cactaaaaca 4440 

agaaaagaac ccagrrccca gcgctgaaga acggacgctr caagggc::ca gagcacccaa 4500 
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ggtagagctg gggacgaatg cttagtgccc ccacttcttc tccccctccc tacaccatce 480 

gagcacccat cgctcaccat cagatcaacc tccgatctta catcacgatg taatcaccac 540 

tggagcttca ccgctactaa accattaacc tctcgcctcc agtgctccac ctcrgaggci: €00 

gagcattaca agaaaatgac ctccgccccc tttcactgca gaaaactgcc aggggcccac 660 

ctcagaacaa ctcccactta ctctccaccg gctcccaaac cccccaacct acaagtgccg 720 

tgaaccccca cccaggcagt atccacgaaa gcacaagtga ctagccctat gatgtacaaa 730 

gcccgcatcc ctgcgatgan ttctgcgccc ctcactgctt gcaantgcca aacaaagcag 340 

atutacaata catacattct tttacctcgc cccgctttgg ggccaaagcc ctgggctcaa 900 

accttcctat ctgacaagtg aatagctgtt ttcaaaagat aaccca 946 

<210> 115 
<211> 8943 
<212> DNA 
<213> Hcmo sapien 

<400> 119 

ccaacagccc ctgczccctg ggccccccca tgccatgccg caatccctcc cacccgacca 60 

acaccaacac ccagczccga cgcagccccr ctccgccccr. gccgcccccc gagccacag- 120 

tctccccccg cccc-gcccc cggcccgccg ccgcccccgc gctcgcagcg gccrcggcac 130 

ggcccaggta gcgagcagcg acctcgcgag ccccccgcac ccccgcccgg utccccggcc 240 

gtccgcctac cccrggcccc ctccgccctc cccgcgccgg cccgczzcgc ::tacgcc*cg 5 00 

gcgccgagcc gcccrcccga tcgcccgccg acacgagccg caacggaggc tcccacccgc 360 

ggatcaacac tctgggccgc atgatccgcg ccgagcccgg cccggacctg cgc-acgacg 420 

tgaccagcgg cggcgggggc accagcagga tgcactattc ccggcgcggc gcgatcaccg 480 

accagaactc ggacggctac tgtcaaaccg gcacgacgcc caggcaccag aaccagaacd 54 0 

ccacccagga gccg--gcag aaccgccccg acrgcctgat gcgagcagag crcatcgcgc 6C0 

agcctgaatc gaagracgga gatggaatac aactgactcg gagtcgagaa rrggatgagc 660 

gccccgccca ggccaacgac caaacggaaa tcctcgacag cctgaccaga gagatgcggc 720 

agacgggcca gccctgcgac gcttaccaga aaaggctrcc ccagcnccaa gagcaaatgc 7S0 

gagccctcta taaagccacc agcgcccccc gagcccgcag ggccagcccc aagcgtggcg 840 

gaggccacac ttgccagagt ggctccggcc gggatgagtt caccaaacac gccaccagcg 900 

aatgtccggg gcggacgagg cagcaaaggg cggagatgga catggnggcc cggggtgcgg 960 

acccggcccc agtggagcag cacaccaaca gccaccgggg cacccacaac cccaccggcg 1020 

actaccgccg gcagczggac aaaatcaaag ccgacctgcg cgagaaatcr gcgacctacc 1080 

agtcggagga ggagtacgaa aacctgccga aagcgcccrr ngagaggacg gaccaccrgc 1140 

gacagccgca gaacaccatt caggccacgc ccagggagac catgcggacc aacgaccgcg 1200 

aggaggagga gctgccgtac gaccggagcg acaagaacac caacatcgcc cagaaacagg 1260 

aggccttctc catacgcatg agtcaaccgg aagttaaaga aaaagagccc aataagctga 1320 

aacaagaaag tgaccaactt gtcctcaatc agcatccagc ttcagacaaa actgaggcct .1380 

acatggacac tccgcagacg cagtggagtt ggatccctca gatcaccaag cgcactgacg 1440 

tccatctgaa agaaaacgct gcctaccttc agttttttga agaggcgcag tctaccgaag 1500 

catacctgaa ggggccccag gactccatca ggaagaagta cccctgcgac aagaacacgc 1560 

ccctgcagca cctgccggaa cagatcaagg agccggagaa agaacgagag aaaatcctcg 1620 

aatacaagcg tcaggcgcag aactcggcaa acaagtctaa gaagactgta cagccgaagc 1680 

ctcgcaaccc agactacaga agcaacaaac ccactactct cagagccccc tgtgactaca 1740 

aacaagatca gaaaaccgtg cataaggggg acgagcgcaT: cctgaaggac aacaacgagc 1300 

gcagcaagtg gcacgcgacg ggcccgggag gcgttgacac gcncgttccc cccgcggggc 1860 

tgaccatccc tcctccgaac ccactggccg tggacrtcrc ttgcaagatt gagcagtac* 1920 

acgaagccat cttggccctg tggaaccagc cccacaccaa cacgaagagc ccggcgccct 1980 

ggcactaccg catgaccgac atagagaaga ccagggccat gacaatcgcr aagccgaaaa 2040 

caacgcggca ggaagattac acgaagacga cagccgacct cgagctacac taccaagagc 2100 

tcaccagaaa cagccaaggc ccagagacgc ccggagatga tgacaagcgg aaaacacagz 2160 

cccagcccac cgacgcccag aagcacracc agaccccggc catccagccr cccggccacc 2220 

cccagcacca gacagcgacc acaacrgaaa ccactcacca tggaacctg" zaagacg-ca 2230 

accacaacaa agcaaccgaa accaacagag aaaatgacaa gcaagaaaca rggacgctga 234 0 
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ttaacgatct acagagagag agggaaaatt tgagacagga aattgagaaa tcccaaaagc 5700 

aggctctaga ggcacctaat aggatccagg aatcaaagaa tcagtgtact caggcggcac 5760 

aggaaagaga gagcctcctg gtgaaaacca aagccctgga gcaagacaag gcaaggcrgc 5820 

agaggcrgga ggacgagctg aatcgcgcaa aaccaactcc agaggcagaa accagggtga 5880 

aacagcgcct ggagcgtgag aaacagcaaa ntcagaacga cctgaatcag tggaagaccc 5940 

aatat-cccg caaggaggag gctattacga agatagaatc ggaaagagaa aagagtgaga 6000 

gagagaagaa cagtcttagg agtgagaccg aaagactcca agcagagacc aagagaatcg 6060 

aagagaggtg caggcgcaag ctggaggaet ccaccaggga gacacagcca cagttagaaa 6120 

cagaacgctc ccgacatcag agggagacrg ataaacccag acagcgccca catgggcccc 6180 

atcgagagac ccagactgag tgtgagcgga ccgccgacac ccccaagccg gtgttcgacg 6240 

ggccgaggaa gaaggtgaca gcaacgcagc cctatgagcg tcagctgacc gacaaaacaa 6300 

cctcggacaa accattgaag gggaagaagt cagtggaaga agttgcttct gaaatccagc 6360 

cattccttcg gggtgcagga tctatcgctg gagcacctgc ttctcccaag gaaaaatact 6420 

cttcggcaga ggccaagaga aagaaat-aa tcagcccaga atccacagrc acgcttccgg 6480 

aggcccaggc agccacaggc ggtataac-g atccccatcg gaatgagaag ccgaccgtrg 6540 

acagtgccat agcccgggac ctcattgacr ccgacgaccg tcagcagaca cacgcagcag 6600 

aaaaagctat caccggtttc gacgatccac trccaggcaa gacagtacct gccccagaag 6660 

ccaticaagaa aaatttgacc gatagagaaa ccggaacgcg cc^gctggaa gcccagaccg 6720 

cctcaggggg tgcagtagac cccgcgaaca gcgcccttrc gccaaaagan gccgccttgg 6760 

cccgggggct gaccgacaga gatrtgtarc gacccctgaa cgatccccga gacagtcaca 634 0 

aaaacrccgt ggacccagcc accaaaaaga aggtcagcta cgtgcagccg aaggaacgci 6 900 

gcagaaccga accacacact ggcctgcrcc cgctcccagt acagaagaga agcatgccct 6950 

tccaaggaac cagacaacct gtgaccgcca c::gag::cagc agactccggc acattgagac 7020 

cgtccactgc caatgaaccg gaatccggtc agate tctra cgacgaggct ggtgagagaa 7080 

ttaacgactt cccccagggt tcaagctgca cagcaggcat acacaaugag accacaaaac 7140 

agaagcctgg canctacgag gccatgaaaa ttggcccagc ccgacccggc accgctccgg 7200 

agctgctgga agcccaagca gctactgcct C-atacLgaa ccczgr.cagc aactrgaggc 7260 

caccactgga ggaagcccac aagagacgrc t:ggtgggcac cgagtccaaa gagaagcrcr 7320 

cgccrgcaga acgagccgcc actgggcaca acgacrccga aacaggaaac atcacc-ctr 7380 

tgttccaagc catgaataag gaactcaccg aaaagggcca cggtatncgc ctaccagaag 744 0 

cacagaccgc aaccgggggg atcatcgacc caaaggagag ccatcgttca ccagccgaca 7500 

tagcacataa gaggggctat ttcaatgagg aactcagtga gatcccctca gatccaagtg 7560 

acgacaccaa aggacttttc gaccccaaca ccgaagaaaa tcctacctat ccgcaacnaa 7620 

aagaaagatg caccaaggac gaggaaacag ggccctgtct uccgcctcng aaagaaaaga 7680 

agaaacaggc gcagacatca caaaagaaca ccctcaggaa gcgtagagtg gtcatagccg 774 0 

acccagaaac caacaaagaa atgtctgtcc aggaggccta caagaagggc ccaatcgatw 7800 

atgaaacctc caaagaactg tgtgagcagg aacgtgaatg ggaagaaata accatcacgg 7860 

gatcagatgg ccccaccagg gtggtcccgg cagatagaaa gacaggcagt cagcacgata 7920 

ttcaagatgc cactgacaag ggccctgtzg acaggaagrt cctcgatcag taccgacccg .7980 

gcagcctcag ccccactcaa tttgctgaca tgaccccctt gaaaaatggc gtcggcacca 8040 

gcagcagcat gggcagtggt gtcagcgatg acgtttttag cagcccccga qacgaatcag 8100 

taagcaagat tcccaccata tccagcgcca ggaattcaac cacaaggagc agctcctttt 8160 

cagacaccct ggaagaatcg agccccatcg cagccatctt tgacacagaa aacctggaga 8220 

aaatccccat tacagaaggc acagagcggg gcaccgctga cagcatcacg ggccagaggc 8280 

ttccggaggc tcaggcccgc acaggtggca ccacccaccc aaccacgggc cagaagctgt 8340 

cactccagga cgcagtcccc cagggtgcga ttgaccaaga catggccacc agcgtgaagc 8400 

ctgctcagaa agcctccaca ggctccgagg gcgcgaaggg aaagaagaag acgtcagcag 8460 

cagaggcagt gaaagaaaaa cggctcccgt atgaggctgg ccagcgcccc ccggagctcc 8520 

agcacctcac gggaggtctr gccgacccgg aagcgcacgg gaggacaagc accgaagaag 8580 

ccatccggaa ggggcccaca gatggccgcg ccgcacagag gctgcaagac accagcagcc 8640 

acgccaaaat cctgacctgc cccaaaacca aaccaaaaac accctacaag gacgccacaa 8700 

accgctccat ggtagaagac accactgggc cgcgccctct ggaagccgcr rccgcgccgr 8760 

ccaagggctt acccagcccc tacaacacgc cctcggctcc ggggrcccgc tccggczzcz 8820 

gcicgggatc ccgccccgga tczcgczzzg ggccccgcag -ggg^cccgg agaggaagct 8880 

ccgacgccac agggaacccr ccccactrtc acrcctarrc acrragcagc agcccra^zg 8940 
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tcgtgagcga ctccaaaggc agcaatgaac ttcaccaagt tccatcgaac cgcgactgcc 180 

taaatggagg aacatgtgtg cccaacaagc actcctccaa cattcactgg tgcaactgcc 240 

caaagaaatc cggagggcag cactgtgaaa tagataagtc aaaaacctgc cacgagggga 300 

atggccactt ttaccgagga aaggccagca ccgacaccac gggccggccc tgcctgccct 360 

ggaactctgc cactgcccct cagcaaacgT: accatgccca cagatctgat gctctccagc 420 

tgggcctggg gaaacacaat tactgcagga acccagacaa ccggaggcga ccctggtgct 480 

atgtgcaggc gggcctaaag ccgcttgtcc aagagcgcat ggtgcatgac tgcgcagacg 540 

gaaaaaagcc ctcctctccc ccagaagaac taaaacttca gtgcggccaa aagactccga 600 

ggccccgctt taagattact gggggagaac ccaccaccac cgagaaccag ccctggtttg 660 

cggccaccta caggaggcac cgggggggct ctgtcaccta cgtgtgtgga ggcagcctca 720 

tcagcccctg ctgggcgatc agcgccacac actgcttcat tgattaccca aagaaggagg 780 

accacatcgt ctacctgggc cgcccaaggc ctaactccaa cacgcaaggg gagacgaagt 840 

tcgaggtgga aaacctcacc ctacacaagg actacagcgc tgacacgcct gctcaccaca 900 

acgacattgc cctgctgaag acccgttcca aggagggcag gtgcgcgcag ccatcccgga 960 

ctatacagac catctgcctg ccctcgacgc ataacgaccc ccagtttggc acaagctgcg -1020 

agaccactgg ctcccgaaaa gagaactcta ccgactaccc ctatccggag cagcrgaaga 1080 

tgactgctgt gaagccgant ccccacccgg agcgncagca gccccaccac tacggccccg 1140 

aagtcaccac caaaacgctg tgtgccgcrg acccacagcg gaaaacagac tcctgccagg 1200 

gagactcagg gggacccccc gtctgtcccc cccaaggccg catgactctg actggaaccg 1260 

cgagctgggg ccgtggatgc gccctgaagg acaagccagg cgcctacacg agagcctcac 1320 

actccctacc ctggacccgc agtcacacca aggaagagaa tggcctggcc ctctgagggt 1380 

ccccagggag gaaacgggca ccacccgctc ccttgccggc cgccacccrt gcagcagagc 1440 

catctccacc agctgtaaga agagaccggg aagat 1475 

<210> 123 
<211> 22S4 
<212> DNA 
<213> Komo sapien 

<400> 123 

cagcgccggc tcgcgccccc ctgccgcagc caccgagccg ccgtctagcg ccccgacctc 60 

gccaccatga gagccccgcc ggcgcgcctg ctcctctgcg tcctggtcgc gagcgactcc 120 

aaaggcagca atgaacccca ccaagttcca tcgaaccgtg actgtccaaa cggaggaaca 180 

tgtgtgccca acaagcactt ctccaacacc cactggcgca actgcccaaa gaaacccgga 240 

gggcagcact gtgaaataga taagtcaaaa acccgctatg aggggaatgg tcacctccac 300 

cgaggaaagg ccagcaccga caccacgggc cggccctgcc tgccctggaa ctctgccacr 360 

gtcccccagc aaacgtacca tgcccacaga cctgatgctc ttcagctggg cccggggaaa 420 

cataatcact gcaggaaccc agacaaccgg aggcgaccct ggtgctatgc gcaggtgggc 480 
ctaaagccgc ttgtccaaga gcgcatggcg catgactgcg cagatggaaa aaagccctcc . 540 

cctcctccag aagaattaaa atttcagcgt ggccaaaaga ctctgaggcc ccgctttaag 600 

atcattgggg gagaatccac caccaccgag aaccagccct ggtttgcggc 'catctacagg 660 

aggcaccggg ggggctctgc cacctacgtg cgtggaggca gcctcatcag ccctcgctgg 720 

gtgatcagcg ccacacactg cttcattgac tacccaaaga aggaggacta catcgtctac 780 

ccgggtcgct caaggcttaa ctccaacacg caaggggaga tgaagcctga ggtggaaaac 840 

ctaatcctac acaaggacta cagcgctgac acgcctgccc accacaacga cattgccccg 900 

ctgaagatcc gttccaagga gggcaggtgt gcgcagccat cccggactac acagaccacc 960 

tgcctgccct cgatgtataa cgatccccag tttggcacaa gctgtgagac caccggcttt 1020 

ggaaaagaga atcctaccga ctatctctac ccggagcagc tgaaaatgac tgttgtgaag 1080 

ctgatttccc accgggagcg tcagcagccc cactactacg gctctgaagt caccaccaaa 1140 

atgctgcgtg ctgccgaccc acagtggaaa acagatccct gccagggaga ctcaggggga 1200 

cccctcgtct gccccctcca aggccgcatg accttgactg gaattgtgag ctggggccgt 1260 

ggatgcgccc tgaaggacaa gccaggcgtc cacacgagag tctcacaccr cttaccctgg 1320 

acccgcagtc acaccaagca agagaatggc ccggccctct gagggtcccc agggaggaaa 1380 

cgggcaccac ccgctttc-r gctggttgct atcttgcagt agagtcacct ccaccagctg 1440 

taagaagagc tgggaataea ggctctgcac agatggatcc gcctgcgcca ccaccaggcc 1500 
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agcgcaggtt tcggacacca gagaaagtca ctcgcttgta ccattgccac ctcagaaagc 420 
cctgatgtga actcaaactt tacccctgtt acccaaagcc aacaactcca aggcagtagc 480 
ttcact 486 

<:210> 126 
<211> 3552 
<212> DNA 
c213> Homo sapien 



<400> 126 

cggcaggcag gtctcgtccc ggcacccccc cggcgcccgc gttctcctgg ccctgcccgg 60 

caccccgatg gccgccgctg ggccccggcg c::ccgcgcgc ggagccgccc gcctgcatcc 120 

gctgctgacc cccgtgatct tcagccgcgc cggtgaagcc tgcaaaaagg tgatacttaa 180 

tgcacctccc aaaccagagg cagacaaaac aaccggcaga gccaatccgg aagagtgccc 240 

caggtctgca gaccccatcc ggtcaagtga -ccrgacttc agagccctaa acgacgggrc -300 

agcgtacaca gccagggctg ttgcgctgtc cgataagaaa agaccactna ccatatggcu 360 

tcccgacaaa aggaaacaga cacagaaaga ggtcaccgtg ctgctagaac atcagaagaa 4 20 

ggcaccgaag acaagacaca ctagagaaac rgtrcccacg cgcgccaaga ggagacgggc 4 30 

acccaccccz tgcrccacgc aagagaattc -rcgggcrcr tccccac-gi: tcccccaaca 540 

agccgaacct gatgcagcac agaactacac cgccccccac ccaacaagcg gacgcggagc 600 

tgacaaagaa cctccaaacc cgttccatac agaaagagac accggaaatc cacttcgcac 560 

ccggcccgcg gaccgtgaag aacacgacgc rzccgacctg attgc-cacg cgtcaaccgc 720 

agacggacat: ccagcagatc tgcccccccc acracccatc agggcacagg atgaaaacga 780 

caaccaccct gtctccacag aagcaactta caacctcgaa gttntggaaa gr,agcagacc 840 

tggtactaca gcgggggcgg tttgcgccac agacagagat gaaccggaca caatgcacac 900 

gcgcc-gaaa cacagcattt cgcagcagac accaaggcca cccgcgctct Jittctgcgca 960 

ccccagcaca ggcgcaacca ccacagtctc tcaccacceg gacagagagg tcgtagacaa 1020 

gcacccattg acaatgaaag cacaagacat ggatggccag cctttcggac tgacaggcac 1080 

atcaacttgc accacaacag caacagatcc aaacgacaac gcacccactt tcagacaaaa 1140 

cgcttatgaa gcatctgtag aggaaaatgc acticaacgtg gaaatcci:ac gaacacccac 1200 

agaagacaag gatctaatta acaccgccaa rrggagagcc aatcccacca tctcaaaggg 1260 

aaatgaaaat ggacacttca aaatcagcac agacaaagaa actaatgaag gtgtcccccc 1320 

tgctgtaaag ccaccgaacc atgaagaaaa ccgccaagcg aacctggaaa ttggagcaaa 1380 

caatgaagcg ccatccgcta gagatatccc cagagtgaca gcctcgaaca gagcctcggc 1440 

tacagtccac gtgagggacc tggacgaggg gccrgaatgc actcctgcag cccaacacgc 1500 

gcggactaaa gaaaacctag cagtggggcc aaagatcaac ggctataagg catatgaccc 1560 

cgaaaataga aatggcaatg gtttaaggta caaaaaactg catgatccta aaggttggac 1620 

caccactgac gaaactccag ggccaatcac aactcccaaa atcctggata gggaggccga 1680 
aacccccaaa aatgagttgt ataatattac agccctggca atagacaaag atgatagacr . 1740 

atgtactgga acacttgctg tgaacattga agacgtaaat gataatccac cagaaacact 1800 

tcaagaatat gtagccattt gcaaaccaaa aacggggtat accgacattt; tagctgttga 1860 

tcctgatgaa cctgtccatg gagctccatt tcacctcagc ccgcccaaca cttctccaga 1920 

aatcagtaga ctgtggagcc tcaccaaagt taatgataca gccgcccgtc ttccatacca 1980 

gaaaaatgct ggattccaag aatataccat ::cctattact gtaaaagaca gggccggcca 2040 

agctgcaaca aaattatcga gagttaatct gcgtgaacgc actcatccaa ctcagtgccg 2100 

tgcgacttca aggagcacag gagtaatact cggaaaacgg gcaacccttg caatactact 2160 

gggtacagca ccgctctttt ctgtattgct aacctcagta cgtggagctt ttggtgcaac 2220 

taaagggaaa cgtcctcctg aagatttagc acagcaaaac rcaattacat caaacacaga 2280 

agcacctgga gacgacagag cgtgctctgc caatggaccc acgacccaaa ctaccaacaa 2340 

ctccagccaa ggtttttgcg gtactacggg atcaggaacg aaaaatggag ggcaggaaac 2400 

cactgaaatg acgaaaggag gaaaccagac cccggaaccc tgccgggggg ctgggcacca 2460 

rcacacccrg gacccctgca ggggaggaca cacggaggcg gacaaccgca gatacaccca 2520 

crcggagtgg cacagcccca ctcaaccccg rcccggcgaa aaaccgcacc gatgcaacca 2580 

gaangaagac cgcacgccac cccaagacca rgcccccacr tacaacrazg agggaacagg 2640 

ar-rccagcr ggt::crgtgg gcrgccgcag cgaaaagcag gaagaagatg gccczgaczc 2700 
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<400> 125 

agcgcgatgg atacztgcag aattcgggcc aagcgtggtc gcggcccgag gtccggaact 60 

tcccagcacy tgaaaaggag ccccctgagc cgactcggcr aaagccccac ctccgccccc 120 

cctcacttcc gccractgac ttccccggag catccatctg aacattaccg cctgctgcgt 130 

aacccggtac acacacagca tgactccctg gaatagagcg ggccggggtg ctcacgccgg 240 

gagagtgatc gacacgcact ttcaagctac acctaccatt tgcagcaaag gagaaaaaac 300 

accccgagta aattrcatca cttttcataa catcagcacc tgctccacca tcaaggagcc 360 

tcagcgcaac aggaccccca gtc^ctggct: caactgtggc agtgacagtg gcatcaagaa 420 

tgggataaaa tccc-gtttc acattggcat aaatcatcac aggatgagga aaatggaggc 480 

tgccrcttcc cacaaaggct tccacagcgg ccgggggcac agacccgccc gggcggcrgc 540 

tcgaaa 546 

<210> 13C 
<211> =155 
<212> DNA 
<213> Hcr;0 sapien 

<4G0> 13: 

accaaccgag gcgcrgggca gcgacccccg cagcggagac agagactgag cggcccggca 60 

ccgccacgcc tgcgczccgg ctgggctgcz gcccctgcct gccgcucctc ctgcccgcag 120 

cccgggccac ctccaggagg gaagtctgcg accgcaacgg gaagcccagg cagcgtacc- 180 

ccgaccggga act-::acaga caaaccggca acggactccg ccgccccaac cgcaacgaca 240 

acactgacgg cattraccgc gagaagcgca agaacggctt ccaccggcac agagaaaggg 300 

accgcrgn::!: gccczgcaac tgcaacccca aaggtcctct cagcg':tcga tgtgacaact 360 

ccggacggcg cagr-gcaaa ccaggtgcga caggagccag atgcgaccga cgtccgccag 420 

gctcccacat gccracggar. gcggggcgca cccaac^acca gagac-gcta gactccaagz 480 

gcgacrgcga cccagccggc atcgcagggc cccgtgacgc gggccqccgc gtcrgcaacc 54 0 

cagc::gccac cggagaacgc tgcgacaggc gccgaccagg tnsctataac "cggacgggg 600 

ggaaccrccga gggczgcacc cagcgettcc gccacgggca ccragccagc cgccgcagc" 650 

ccgcagaaca cagcg-ccat aagatcaccc ctaccttcca ccaagatgcc gacggcrgga 720 

aggcngtcca acgaaatggg ccccctgcaa agctccaatg gccacagcgc caccaagang 780 

cgtctagctc agcccaacga ctagaccccg cccatcccgt ggctcccgcc aaar.tccrtg 840 

ggaaccaaca ggtgagccac ggtcaaagcc cgcccttcga ctaccgtgtg gacagaggag 900 

gcagacaccc acc-grccac gacgcganrc tggaaggtgc cggLctacgg accacagctc 960 

ccttgacgcc actcggcaag acactgccut gcgggctcac caagacttac acattcaggt 1020 

taaatgagca cccaagcaac aattggagcc cccagccgag ccactccgag caccgaaggc 1080 

tactgcggaa ccccacagcc ccccgcaccc gagctacata cggagaatac agcactgggc 1140 

acatcgacaa cgtgaccctg acttcagccc gccctgtctc tggagcccca gcaccctggg 1200 

tcgaacagcg tata::gtcct gttgggtaca aggggcaatt ctgccaggac tgtgcttccg .1260 

gctacaagag agacrcagcg agactggggc cttttggcac ctgtattcct tgtaaccgtc 1320 

aagggggagg ggcczgtgat ccagacacag gagattgtca ttcaggggat <3agaacccug 1380 

acattgagtg tgccgactgc ccaattggtc tctacaacga tccgcacgac ccccgcagcc 1440 

gcaagccatg tcccrgtcat aacgggttca gctgctcagc gacgccggag acggaggagg 1500 

cggcgtgcaa taaccgccct cccggggcca ccggtgcccg ctgcgagccc tgtgccgatg 1560 

gctactttgg ggaccccttc ggtgaacacg gcccagtgag gccttgtcag ccctgtcaat 1620 

gcaacaacaa tgcggacccc agcgcctctg ggaattgtga ccggctgaca ggcaggcgre 1680 

cgaagcgcat ccacaacaca gccggcaccc actgcgacca gtgcaaagca ggctacccrg 1740 

gggacccatr ggcrrccaac ccagcagaca agtgtcgagc trgcaactgc aaccccacgg 1800 

gctcagagcc cgtaggacgc cgaagcgatg gcacctgcgt tcgcaagcca ggacttggig 1860 

gccccaaccg tgagcacgga gcattcagct: gtccagcctg ccacaatcaa gcgaagactc 1920 

agacggacca gctcacgcag cagcttcaca gaacggaggc cccgatctca aaggcccagg 1980 

gcggtgacgg agcagcacct gacacagagc cggaaggcag gacgcagcag gccgagcagg 2040 

cccrtcagga catircgaga gatgcccaga ccccagaagg cgccagcaga tcrctrggrz 2100 

cccagtrggc caagg-gagg agccaagaga acagccacca gagccgcccg gacgacriza 2160 

agatgactgc ggaaagagtt cgggctcrgg gaagccagta ccagaaccga gcrcgggara 2220 
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<400> 131 



aggtccggag ggcccacagc cggatgcggg acaccgggaa 
ttctgcaccc cggttgcagc gtgctgcaga cgaagtcctc 
cctgggcagc caycacgagg atcatgaccc ggaaaacaaa 
ccccgatgct ggtggagtgt ttgttgacac ccccgacgaa 
ccattgcgcc ggttcacccc tgagtcctgt tcccaacgac 
agaacgaggg caagacccct ccgcgagggt ttcagacccc 
4:gcccagaag ccaatgggtg cacagcgatg acacgaacgt 
aggacgccgc ctggaatacc caaattgaac cacagatgca 
gaactuccct tccgccacac agaaattgct cagccagatc 
cc-gaccctt cctgctcccc aggaagggag gtcagccccg 
cg-.gacaccg gagacaggtc ttcttcaccg acaggaagcg 
ttaaccgcta t 



aaagtggtca cagcacacat 
ccgctcgcca ccccacaccc 
gacgactgtg atccacacct 
agtgtgcagc gtcccccaat 
tgccagtgtt tcagacccaa 
cccctcccac cccactggag 
caatcttcgc tcggccagcg 
cgaagagggc gtacaagcua 
tcctgtactt cttttccrcc 
tctgcaaaac acaggacgcc 
ccctctggtg cctgcacgct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
671 



.<210> 132 

<211> 590 

<212> DNA 

<213> Homo sapien 



<400> 132 



ctgaacggaa 


aagcctatgg 


ctccgtgatg 


atactagtga 


ccagcggaga 


tgataag 




60 


cczrggcaatc 


gcccacccac 


tgcgctcagc 


agcggctcaa 


caatccactc 


cattgcc 


czg 


120 


ggtccatctg 


cagccccaaa 


tccggaggaa 


tcaccacgtc 


uracaggagg 


tttaaag 




130 


ctngccccag 


atatatcaaa 


ctccaatagc 


acgattgacg 


ctcccagrag 






240 


ggaactggag 


acacccccca 


gcaacatatc 


cagcctgaaa 


gtacaggcga 


aaatg-c 


aaa 


300 




aatcgaaaaa 


cacagtgacc 


gtggataata 


crgtgggcaa 


cgacacz 


atg 


350 


utrcragctia 


cgcggcaggc 


cagcggtcct 


cccgagatta 


"atcactcga 


r::ccga- 


gga 


420 


cgaaaacact 


acacaaataa 


ccctaccacc 


aacccaactt 


cccggacagc 


cagccri 


-gg 


480 


accccaggaa 


cagccaagcc 


tgggcaccgg 


acctacaccc 


cgaacaacac 


ccatcar 




540 


ccgcaagccc 


tgaaagtgac 


agtgacctct 


cgcgccccca 


actcagacct 






590 



<210> 133 
<211> 581 
<212> DNA 
<213> Homo sapien 



<400> 133 

aggtcctgtc cgggggcact gagaaccccc cccggaattc ccggggggtg ttggggagag 60 

actgtgggcc tggagataaa acttgtctcc tctaccacca ccctgtaccc tagcccgcac 120 

ccgtrctcat ctctgcaaag ttcagctccc ctccccaggt ctctgtgcac. cctgtcccgg 180 

acgczccggg gagcccatgg gtggaggagt ctccaccaga gggaggctca 'ggggaccggt 240 

tgggccaggg acgaatattt gagggacaaa aaccgtgtaa gagccaaaga attggcagca 300 

gggggagaac agagaggagc cgggctatgg gaaatgacct gaataacgga gccgggaaca 360 

tggctggata tctggcacta aaaaagggtc ttcaagaacc cacttcctaa tctctrcccc 420 

aatccaaacc atagctgtcc gtccagtgct ctcttcccgc ccccagctct gccccaggcc 480 

cczccragac tctgtccctg ggctagggca ggggaggagg gagagcaggg ctgggggaga 540 

ggccgaggag agcgcgacat gtggggagag gaccagaccc c 581 

<210> 134 
<211> 4797 
<212> DNA 
<213> Homo sapien 

<220> 

<22l> misc feature 
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ccattctgac ccggtttcLC caaactgaag cagcctctcc ccaggtccag ctctggaggg 3120 

gagggggacc cgaccgcttt ggacctaaat ggcctcatgc ggctggaaga tcctgcgggt 3180 

ggggcccggg gctcacacac ctgtagcact cactggcagg accaagcatc tcgggggggt 3240 

ggccgctgag cggcagggga caggagccac tctgtttcgt ggggaggcct aacctagata 3300 

tcgacucgtc ttcgcacacg cctcctccag ttccttgtcc acagcccagc agacctcgcc 3360 

acttctgagg caagttaagc aagttgatcc ggcacccccc catcttgctc ccctaaccca 3420 

tggtcgggag acagcatcag ggttaagaag accttttttt ctctttttaa actaggagaa 3480 

ccaaacccgg aagccaaaac gcaggctcag tttgtgtgtt gtctctcgag tttgtcgccc 3540 

atgtgcgcaa cagggtatgg actatctgtc cggtggcccc gtcctggcgg tccgccggca 3600 

ggctggccag cccaggccgc cgtggggccg ccgccccttt caagcagccg tgcccgcgtc 3660 

catgcgccca gggccatgct gaggcccggg ccgctgccac gttggagaag cccgtgtgag 3720 

aagtcaatgc cgggactcag ccttcagaca gagaggactg cagggagggc ggcaggggcc 3780 

tggagaccct cctgcaggcc cacgcccgcc cccctgtggc gccgtctcca ggggctgctt 3840 

cctcctggaa accgacgagg ggtgtctcgg gcagagctgg ccctgagcgc c"ccacccaa 3900 

ggccaggttc tccgttagct cctgcggccc caccctgggc cctgggctgg aatcaggaac 3960 

acttcccaaa gagcgatagt cttctgcrtt cggcaaaact ccacttaatr caatgggtt- 4020 

ttccccgtac agcagatctt ccaaatgcaa caaaccccaa tataaagcag tccgcgaacg 4080 

ccacrgcctc cgctcctcgc ctccgtgcrg tgcgcgacgt gaccggactc ccctgcaaac 4140 

accaacatgc cgggaaactc ggcccgaanc rccgcgcccc cgcctccccc acggggaggg 4200 

actctggttc cagggtcccc ctgtgcactt gcctttttgi: :;ctggctgaa actcucctgg 4260 

aggtcggcag gcccagccaa ggtttitacaa ggctgatgtc aatctctgtg tcgccaacc* 4 320 

ccaagcccat cccctaaatg gcaaaggaag gcggatggcc ccagcacagc ccgacczgag 4380 

gccgcggtca cagcggaggc gtggagccga ggcctacccc r.cagacacrc tggacaczcc 4440 

ccccccaccc ggctgcagag gccaganncc agcccagggt cctgcactca ctcgcttatc 4500 

tgacaacgtt tcagcgaccc cgctggccac tccgagagcg ggccagccrg tggatcagag 4560 

acgcaccacc aagccaaggg aacctgcgcc cggcatccga caccgcgacr ttctgcc::gg 4523 

agcgtacgac tgcacatgac tcggggg-gg ggaaaggggt cggctgacca zgczcazczg 4 630 

ctggcrcgcg ggacggcncc caagccagag gtaggtccac ctgcgcaacg acaacsaacg 4740 

gtacrcgtca tmcgggcaa cggccgc::gt ggtggtggct gagtctccc:: ttggcrr 4797 

<210> 135 
<211> 2856 
<212> DNA 
<213> Komo sapisn 

<400> 135 

tagtcgcggg tccccgagtg agcacgccag ggagcaggag accaaacgac ggggg^cgga 60 

gtcagagtcg cagcgggagt ccccggaccg gagcacgagc ctgagcggga gagcgccgcn 120 
cgcacgcccg tcgccacccg cgtacccggc gcagccagag ccaccagcgc agcgccgcca . 180 

tggagcccag cagcaagaag ctgacgggcc gccccatgct ggctgtggga ggagcagcgc 24 0 

ttggctccct gcagtttggc tacaacactg gagtcatcaa tgccccccag' aaggcgaccg 300 

aggagttcta caaccagaca tgggtccacc gctatgggga gagcaccctg cccaccacgc 360 

tcaccacgct ctggtccctc ccagtggcca tcttttctgt tgggggcatg attggccccc 420 

tcccrgtggg ccttcccgcc aaccgctctg gccggcggaa ttcaatgctg atgacgaacc 480 

tgctggcctt cgcgtccgcc gtgctcacgg gcttctcgaa actgggcaag ccctcrgaga 540 

tgctgatccc gggccgcttc atcaccggtg tgcactgcgg cctgaccaca ggctccgtgc 600 

ccacgcatgt gggcgaagtg tcacccacag cctttcgtgg ggccccgggc accctgcacc 660 

agctgggcat cgccgtcggc atcctcaecg cccaggcgtt cggcctggac cccatcacgg 720 

gcaacaagga cctgcggccc ctgctgccga gcatcatcct catcccggcc ccgctgcagr 780 

gcaccgtgcc gccctcccgc cccgagagcc cccgccccct gctcatcaac cgcaacgagg 840 

agaaccgggc caagagtgtg ctaaagaagc cgcgcgggac agctgacgcg acccacgacc 900 

cgcaggagac gaaggaagag agtcggcaga tgacgcggga gaagaaggrc accaccctgg 960 

agctgttccg cccccccgcc taccgccagc ccatccccac cgccgtggcg ctgcagccgc 1020 

cccagcagcc gcccggcacc aacgctgrct cccaccaccc cacgagcarc cccgagaagg 1080 

cgggggtgca gcagcccgcg tacgccacca ccggccccgg "atcgtcaac acggcctcca 1140 
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gtcactggct gcccccggaa cagggcgctg ctccatggct ctgctcgtgg tagtctgtgg 
ctatgtctcc cagcaaggac agaaacccag aaaaatcaat ccccctatcc tcactcttgt 
ccctcttccc aaagacatcg gcgaggtaat tcgtgcccct ttcacctcgg cccgcgacca 
cgctaaggcc aaanctccag acanayggcc gggccggcnc nataggggan cccaactcgg 
ggacccaaac tctggcgcgg aaacacangg gcataagctt gnttcctgtg gggaaa 



120 
180 
240 
300 
356 



<210> 138 
<211> 353 
<212> DNA 
<213> Homo sapien 

<400> 138 

aggcccagtc ctccacttgg cctgatgaga gtggggagcg gcaagggacg tttcccccgc 60 

aacagacact tagatttctc tcttgtggga agaaaccacc tgtccaccca ccgactcccc 120 

tacattgatg tggaaaccgc cgctgccacc accacctcct gaagaggctt ccccgatgcc • 180 

aatgccagcc accctggcac cctggccctc gagcaggctg cggtaagcag cgaccccccg 240 

ccccagccgt gcctctatgc caagcagcat crtgcactcc tggtcctgag cccccatctc 300 

gcatcggagc tcactcagac cccgsccgsg rassmcgctatn gccgaattcc age 353 

<210> 139 
<211> 371 
<212> DNA 
<213> Hcmo sapien 

<400> 13S 

agcgrggtcg cggccgaggc ccacccgaag caagajtgca gatggcagtg cgaacagaga 60 

agacacattc tacaccccaa agctttggcg caattcccat cgaccagagc cggcccgacr 120 

agcctcggaa aggccaccga aaaatctcca actiggatwat gttgacctct accctacrca 180 

ttttccagtg cctgcaaagc caggcgagga agngaiccca aaagacgaaa atggaaaaac 240 

actatttgac acagcggacc tctgcgccac gcgggaggcc gcggagaagc gtaaagacgc 300 

aggaccggac ccgcccgggc ggccgcrcga aagccgaatc ccagcacacc ggcggccgct 360 

actagtggac c 371 

<210> 140 
<211> 370 
<212> DNA 
<213> Homo sapien 

<400> 140 

tagcgtggtc gcggccgagg tccatctccc tttgggaact agggggctgc tggtgggaaa 60 

tgggagccag ggcagatgtc gcattccttt gtgcccctgc aaatgtggga 'ctacaagaag 120 

aggagctgcc tgagcggtac tttctcctcc tggtaatcct ccggcccagc ctcatggcag 180 

aatagaggta cttctaggct atttttgtaa catggcctct ggtcaaaatc cccgtgtagc 240 

cgaattccca agccctgcat tgtacagccc cccactcccc tcaccaccca acaaaggaat 300 

agttaacact caaaaaaaaa aaaaaacccg cccgggcggc cgcccgaaag ccgaacccca 360 

gcacactggc 370 

<210> 141 

<211> 371 

<212> DNA 

<213> Hcmo sapien 

<400> 141 

tagcgtggtc gcggccgagg tcctctgtgc igcccgtcac agcccgatgg taccagcgca 60 
955tgtaggc agtgcaggag ccctcatcca gcggcaggga acaggggtca ccaccatccc 120 
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cggaatccgg ggtgccctta taggaccaga ggttgtgttt gctccaccct ctcgaccccc 300 

atgtgagacc tcggccgcga ccacgctaag ccgaactcca gcacaccggc ggcccgttac 360 

cagcggaccc g 371 

<210> 146 
<211> 355 
<212> DNA 
<213> Homo sapien 

<400> 146 

ggtcccccgt ccccrtccca gaggtgtcgg ggctcggccc cagcctccat cttcgtctcc 60 

caggacggcg agtagcagcg gctccaaggc cgaattcatt gtcggaggga aatacaaact 120 

ggtacggaag accgggcctg gctccctcgg ggacatccat ttggcgacca acaccaccaa 180 

cggcgaggaa gcggcagcga agctagaacc ecagaaggcc aggcatcccc agctgctgca 240 

cgagagcaag ctctataaga ttcttcaagg tggggttggc accccccaca cacggcggca -300 

tggtcaggaa aaagaccaca atgtactagt catggatctt ccgggaccca gcctc 355 

<210> 147 

<211> 355 

<212> DNA 

<213> Heme sapien 

<400> 147 

ggcccgztac aaaacgaaga cagacaacac aacatttacc ctgcggagat atcccaccca 6C 

taccatgcac gtgcigtgat tncgaacata acccgtccca aaaaccrccc acgatcaccc 120 

tgactrrcca ggcrggctga rccaccaatc ccgcacc-caa ccgctactt:: ctccccagtg 180 

tcgccacgag caaagctigac ccgaacagca accaacggcc gcagacac^c aacatgcagt 240 

cc-ticcccau aacacgggaa acatc-naag cctatcatrc caciatgagg ataaacrrgct 300 

acatccggta tacrrtcact ctctgaaaca caatccaccc ccggcactcc cccag 355 

<210> 148 

<211> 365 

<212> DNA 

<213> HoT.o sapien 

<400> 143 

aggtcrctcc ccccccctcc ctcccccgcc agccaagtga agacatgctc accccccctt 60 

cacctccctt catgacgcgg gaagagtgct gcaacccagc cctagccaac accgcatgag 120 
agggagcgtg ccgagggcct ctgagaaggc tcctcccaca tctagaaaga agcgcccaag . 180 

angtggcagc ccccctcctc caagcggctc ctgtcccgtt gccctgggag ttctcaaact 240 

gctgcagcag cccccatcca gcctgaggac gacatcaata cacagaggaa $aagagccag 300 

gaaaagacga gagaagctac agactcccct gggcgacccc gagagcttac cattccccag 360 

acttcctca 36 9 

<210> 149 

<211> 62C 

<212> DNA 

<213> Hcrr.D sapien 

<220> 

<22l> misc_feacure 
<222> (1: . . . (620) 
<223> n = A,T,C or G 

<400> 14r 
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gagacatgaa cggactcagc cccacccagg cactcccccc cccactcccc atgccatcca 1440 

ccccccactg cacaccccca cctccgtacc ccacagactg cagcatcgtc agcttcccag 1500 

cgaggctggg ctgttcatca tgtccggact acrtcacgac ccaggggccg accaccatcr 1560 

accagatcga gcactactcc atggatgatc cggcaagtct gaaaacccct gagcaacccc 1620 

gacacgcgat ccggaagggc accctggacc accggcagct ccacgaaccc ccctccccct 1680 

cccatcccct gcggacccca agcagtgccc ccacagtcag tgtgggctcc agtgagaccc 1740 

ggggtgagcg tgccactgac gctgcgcgat ccaccctccg ccagaccatc tctttcccac 1800 

cccgagacga gtggaatgac rtcaaccttg acacggatgc ccgccgcaac aagcaacagc 1860 

gcaccaaaga ggagggggag tgagccccac catgtgagcc ctccctatcc ctcccccaac 1920 

tgccagcccc ctaaaagcac tcctgcctaa cctccaaagc cctctcccta gctcctcccc 1980 

ctccccttgt ccgacttctt aggggaagga gaagtaagag gcctacttcc caccccaacc 2040 

atctgacctg gcacctaatt ctgactctgg ctctaagcct tcaaaactat agcttgcaga 2100 

accgcagctt gccatggcta ggtagaagcg agcaaaaaag agttgggtgt ctcctcaagc 2160 

tgcagagact tcrcactgac tttcataaag zacgctcacc cttacagtcc aagaczacac 2220 

atacaaangt acaaatatac agcacagatt tttgggtggg gggcattgag tactg-rcaa 2280 

aacgcaatcc aaacgaaaga aaatcgagtt gcactnactg accattttcc aacccacrtg 2340 

tcrcggatgg cccgtccata cccccccccc caaggggcat catgtacggc gacaggtacc 2400 

cagagccraa tgczacacgc gagcgacgat gatgtacaga ttctctcagc tctccggati: 2460 

ccaaacacac gccacatcaa accttcgagc agacccactc ccattgct^a ttacgraggn 2520 

aagacr:gcag acacgtaccc ttctcccagt gccggcatac tccacactac cgacac-tct 2530 

tcragcgatg atggttcacg ttggggcgat ttaatccagc tataagaaca agctcacgtc 2640 

caaacgtccc ccccagcttc cggtcgggaa egaggaaaac tctcaaaagg cccacagcag 2700 

ccagtccaaa aacacccgac gccatgcatc cgagcatatc agcaaccccc ttaaactcaa 2760 

taccagacac ctcacrttac aatactgatc gggaaaacac ttgctgccac tacagaggca 2820 

CLaaaactaa anttcactac tagattgact aacccaaata cacacttgcr actgtrgcaa 2880 

_ gaatcctgat tgaitcgatt: gggatgaacg ccanccatcc agctctaaca gcgaacccr.t 2940 

accgtctatt aatattcagg gtaaatagga atcacccaga aacgtcgagc ctgcacraaa 3000 

cagnaagata ccccaatgaa ccataaattc aacticegcaa aaaccctctg aagcaraaac 3060 

aacat-gttc ggtaaacgcc tccctcgccc ggtaaangct ccytctaaag accctrceac 3120 

tctataaaac cccgcatgta gaggctcgtt: zacctctccc tccccaaggc tcacaacagg 3180 

agcggcgatc tgaaaaacat aaaactacga gattggttct cctgtggcac aaattgcacc 324 0 

accgcatcac tcicttcttt aaccggtaag agtttcagtt cgctggaaag caaccgcgag 3300 

aacccagttt cccgtccatc tcccttaggg actacccata gacatgaaag gtccccacag 3360 

agcaagagat aagcccttca tggctgctgt tgctcaaacc actcaaacga agagcrccct 3420 

tgaaactttg ggaaaacatg ttaatgacaa rattccagac ctttcagaaa tataacacac 3480 

tcctttgcat gcacgcaaac gagccctgaa atctccccat gcattctggt caagggccgc 3540 

caccgcacat aagcttccat tttaacccca aagcgcaaaa gggccagcgt ggctccaaaa 3600 

ggcaatgtgt ggaccgcctc tgaaaagtgt gtatatatct tgtgtgaaac tgcatacctc 3660 
gtaccttgat tattcttttc ttctccctgg gatagtggga tttccagaac cacacttgaa . 3720 

acctccttct atcgtttttg tattctcatg aaaataccat ttagtaagaa taccacatca 3780 

aacaagaaat aatgctacaa tttcaagagg ggagggaagg gaaagctcct' tttttcatta 3840 

tt-ttttaaa atrttgtacg ttaaagagaa tgagtccttg atttcaaagt tttgtcgcac 3900 

tcaaatggta ataagcactg taaacttctg caacaagcat gcagctttgc aaacccatta 3960 

aggggaagaa tgaaagctgc tccttggtcc cagtaagaag acaaactgcc tcccccactc 4020 

tgctgagggt tcgaacaaac ccaggacttc cgagctatgt cagtactatr caggcaacac 4080 

cagggccttg gaaacccctg tactgtgtct catggatttg gcaccagcca aagcgaggca 414 0 

cccctcactg gctcacctcc tcatggcagc ctactcccct cgagtgtacg agcagccagg 4200 

gcaaggggca aaaggatagt aagcatagaa accactagaa agtgggctua atggagtrct 4260 

cgcggcctca gcccaatgca gccagctgaa gaattgaaaa gttcttgcct ggagacgttt 4320 

acaaacagaa atggaaagca gagtccccat caaacccttc tacccttccu ttttczrggt 4380 

aacccccraa aacaacagta tgtgggatac cgaacgttaa agggacatcr tctccracta 4440 

ccrcracaat tgcacaaaac taagcaaatg ccaaaagttc tacatgctrc atraaigttc 4500 

ccaaaaggta tcacacacgt gacacattct ttaagcctca gctgcctgrc ttctggtacc 456C 

ctzcgccacg ggccrccggg gagccagaac rcaatccaca acctcrt^cz gtccgccagg 4620 

aca"gcaata aaactcaaaa aacaaacaaa aacta 465S 
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<210> 153 

<211> 2007 

<212> DNA 

<213> Homo sapien 

<400> 153 

gaattcgtcg ctgctccagg gaaagttctg utactccact gactctctcr tttcccgata 60 

acatggccag caagaaagta attacagtgt -itggagcaac aggagctcaa ggtggctccg 120 

tggccagggc aattttggag agcaaaaaat utgcagcgag agcagtgacc agggatgtga 180 

ctcgaccaaa tgccctggag ctccagcgcc ttggagctga ggtggtcaaa ggtgacctga 240 

atgataaagc atcggtggac agtgccttaa aaggtgtcta tggggccttc ttggtgacca 300 
acttctggga ccctctcaac caagataagg aagtgtgtcg ggggaagcrg gtggcagact . 360 

ccgccaagca cctgggtctg aagcacgtgg tgtacagcgg cctggagaac gtcaagcgac 420 

tgacggatgg caagctggag gtgccgcact ttgacagcaa gggcgaggtg*; gaggagtact 480 

tctggtccat tggcatcccc atgaccagtg tccgcgtggc ggcctacttt gaaaactttc 540 

tcgcggcgtg gcggcccgtg aaagcctctg acggagatta ctacacctcg gctgtaccga 600 

tgggagatgt accaatggat ggtatctctg ttgctgatat tggagcagcc gtctctagca 660 

tttttaattc cccagaggaa tttttaggca aggccgtggg gctcagtgca gaagcactaa 720 

caatacagca atatgctgat gttttgtcca aggctttggg gaaagaagtr cgagacgcaa 780 

agattacccc ggaagccttc gagaagctgg gattccctgc agcaaaggaa atagccaata 340 

tgcgtcgttt ctatgaaatg aagccagacc gagatgtcaa tctcacccac caactaaatc 900 

ccaaagtcaa aagcttcagc cagtttacct cagagaacca gggagcctzc aagggcacgt 960 

agaaaatcag ctgttcagat aggcctctgc accacacagc ctctttccrr tctgatcct^ 1020 

ttcctcttta cggcacaaca tccatgttga cagaacatgc tggaacgcaa ttgtttgcaa 1080 

caccgaagga cttcccgcgg tcgcctcttc agtaggaagc actgcattgg tgataggaca 1140 

cggtaatttg attcacattt aacttgctag ccagtgataa gggtggtaca actgtttggt 1200 

aaaatgagaa gcctcggaac ttggagcttc cctcctacca ctaatgggag ggcagattac 1260 

actgggactw cccctgggtg agtaatttca agccctaacg ctgaaatrcc cctaggcagc 1320 
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5.0 








55 








60 




Ala 


Val 


Gly 


Leu 


Ser 


Ala Glu 


Ala Leu 


Thr 


He 


Gin Gin Tyr Ala 


Asp 


65 










70 






75 




80 


Val 


Leu 


Ser 


Lys 


Ala 


Leu Gly 


Lys Glu 


Val 


Arg 


Asp Ala Lys He 


Thr 










85 






90 




95 




Pro 


Glu 


Ala 


Phe 


Glu 


Lys Leu 


Gly Phe 


Pro 


Ala 


Ala Lys Glu He 


Ala 








100 






105 






110 




Asn 


Mec 


Cys 


Arg 


Phe 


Tyr Glu 


Met Lys 


Pro 


Asp 


Arg Asp Val Asn 


Leu 






115 








120 






125 




Thr 


His 


Gin 


Leu 


Asn 


Pro Lys 


Val Lys 


Ser 


Phe 


Ser Gin Phe He 


Ser 




130 








135 








140 




Glu 


Asn 


Gin 


Gly 


Ala 


Phe Lys 


Gly Mec 










14S 










150 













<210> 156 
<211> 128 
<212> PRT 
<213> Homo sapien 



<400> 156 



Mec 


Thr 


Ser 


Val 


Arg 


Val 


Ala 


Ala Tyr Phe Glu 


Asn Phe Leu Ala 


Ala 


1 








5 






10 


15 




Trp 


Arg 


Pro 


Val 


Lys 


Ala 


Ser 


Asp Gly Asp Tyr 


Tyr Thr Leu Ala 


Val 








20 








25 


30 




Pro 


Met 


Gly 


Asp 


Val 


Pro 


Met 


Asp Gly He Ser 


Val Ala Asp He 


Gly 






35 










40 


45 




Ala 


Ala 


Val 


Ser 


Ser 


He 


Phe 


Asn Ser Pro Glu 


Glu Phe LeU Gly 


Lys 




50 










55 




60 




Ala 


Val 


Gly 


Leu 


Ser 


Ala 


Glu 


Ala Leu Thr He 


Gin Gin Tyr Ala 


Asp 


65 










70 




75 




80 


Val 


Leu 


Ser 


Lys 


Ala 


Leu Gly 


Lys Glu Val Arg 


Asp Ala Lys Thr 


He 










85 






90 


95 




Cys 


Ala 


He 


Asp 


Asp 


Gin Lys 


Thr Val Glu Glu 


Gly Phe Met Glu 


Asp 








100 








105 


110 




Val 


Gly 


Leu 


Ser 


Trp 


Ser 


Leu 


Arg Glu His Asp 


His Val Ala Gly 


Ala 



115 120 125 



<210> 157 
<211> 424 
<212> DNA 
<213> Homo sapien 
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< I zvpjjaiwe oM> ^iooosn 



C9£t scEaE-sBoa zz^zdSzidbz; :::j23G:iB:jBe oeoBqaPcaa BziSePBooaS Beea^cccea 

C^5X h5j9=i5uob BE:iEQaaBeB oBcQ^EzoaE :iaHDB:ia-a6 :;aEaBBBaBa acaB::caSSB 

OSei SBBOBZiOBB B6aqaBeB:j3 1b:j6bbdbd:: Hoea^BeBBa :i::uBB4aBQB Bd£:5bdbbbb" 

0061 HcczcoEaao czasaBEaoB £aBB:iBa:ja:i aB:i6BBB3B3 Ba^aBBBai-a oB6:ic:::iBBa 

OvLT tZ4a5aB6B:j 43B6BaBa:j3 aaB^Bfie-^BB 66B:i:iBa3:i6 oeaBfiQ^BBB BBO^aBUzqB 

C€5T 2BB66BBaBfi :i:iB:ia6zDBB :iB3Ba:iUBBB BB:io:ia:iOB:» BBBuBbbbob Ba3ao:iOBB6 

0Z9X 25^V22ZZOD saBBBDBBan ZDBoaBBoBa :iBD6BBfiBBB 0BBq:i3630B BbBbbdbbbb 

C99T £iD:i02B003 66a£aB:;BBB D^BBriBoBBD bdb3bd;:ibo c^oBuiiOBaB BBBaEa::aa:j 

OCST CBBan5e6:iq dbob6b:ibbb qBBqBoq:i:iD ^B:iBBB:jBa5 afiBBBBB^co BB^B^oi^aDO 

O^^X t26BBBBaD0 bb3db:ibboB BbbBboo:io:i :i3:jB:idz::3b BeBBBBaBoa BoB:iDBBa6:j 

OeZX BBBB3BBBB0 ^a^BDBBB^D BBBODO^uBB BDBO^BBOBa BBBBOtlBBaB BqBBBOUaae 

OZtX £:i3BBSqBB6 :jBBB6aeBBB oBbbiBbBbB BBBBijDooBa B03BB0DCDB oaeeBBBBBD 

0921 EdbBbdbdBb BBBBBBoaBB BBBB:i:ia3B:i 6:iBbbbBjo6 4o6:iD2B:ioB ^B:i:iBbBbdB 

OOZX cafiOBBq:iDB :iDBqBoBa33 BB^BiionBoB ao33aBfi:iB3 a:iaaBOOB:ia ^BuBoobBbb 

OfriT £BBBDBB0DB,BB:iDa:iaB3:i :joB3obB:i:iB a:io3BBoooD BaoooDBB:ia BBBBaBaBBB 

0801 ^ £aaB:jBBB:iD 3dB:i30B0Db :»BB63bbdb:i :i:iODaaDB3:i :i3B:j6aBaBq b:i:j3ddBbdB 

020T ' HBBa:jao:iBO :»BBBDOaa:i-4 ao:3BoBBoBB DOBOBaBoBB 6:iBB3BaD6o 3BB6Bb6b3B 

096 65o::a5BBB6 BoBBBBBBaq bobBBbBbiib DBBBBBoq:;^ bB^bbBBbBb bBbBbBBbBo 

006 £SBDa:iBBBD Bo^dbdobbb BoboBBbob^ OBaooBBaBq BoBaBOoBBa BB:ioBao£oB 

0 5-8 HDDCDBDoaD aBoaaoqEBzj dBbooudooE BBziBaoBBoo aBOBODcafii 66oooa:ia:i:i 

Oei. CBODBfiiiBao 6:iBaB3BBBB BoBaBBBBoD BBDfiaoBOBD bboBuoqEoh ::obB6bobbd 

QZL £b£db£3665 £:j£aBBBB6B DOoBtioBB^B BiiBoqocaBB :iBo:iBB6oc-a a:iaB:i:i2cqB 

099 cz5bdbd3bB B^BuBBbbBB Bo^bdBbdob 03:idbobbbd BBoafitiBBBB BouBOZtiaoB 

009 CBofiaDDOBfi B6£B:i:ibBB3 bboBoBbcob BB^BfiBBBoti obBBbcbbbB BaaBoDDBoB 

G^S c-.-5BDBBBB:i caBBBaaao:! :iB3:i03o6:jq oubBBBbBbo BoasocBSBB :io:i66b6Bbb 

081- cBSBBafiBBB aBBooaoBDB BBaoa:iD6B:i aBaoBoEafizj oBooBoaBaB oanBOBczaB 

027 rsBBODBDOD iBSzDVDztzzzk bdbBoBBbdd liaBiBBBiza BBoa^BBiria DiiBoEaBoa:; 

09£: zzoooBDBzb 2z:>ievzzzzi^ bbbbBbbbBd Boz^iezBzzzz HBeoeBB:iDE BijoaBB^BBB 

00£ 2Ha6BBbBBV bEBbeBBBbo zBBBBBBoeo BBBEBEzcoa qDcoq30B£:i cBSBBdbbbd 

OvZ E-zzacaE^o o:j£3E5£DBn a£aBB:jjq3z BzoobBebBb EEaBBczBric c:ia3Eo6BBE 

Of! £e3de3E6e:^ diiEdbc'bezz BboBbEboEo oBBodBooEB BbooEoSoco EbBbdcdBoo 

QZZ z^czzod6'6o BBEDBBBcao oBOODBaooE zcobBosdBe BoodBoBEdB BcoEedbEod 

09 rEBDEEoBEo odbbbBccob oziZZoooBbD dBbEBEzBBb cBboBoBEbb BBaaEBoEoD 

8ST <00fr> 

usxdes cujoH <£XZ> 
VNG <ZXZ> 
6602 <TI2> 
851 <0T2> 

aoBq 



^Z^ ' 








OZv 


-EoaBZDoaa 


qBaoBzDDBB 


OBDEaBB:iEB 


09e 


zzoazoB^oB £aBBBBqD:iB 


BaDqEOD:j63 


ooe 


ebbBoooo^:! 


oaBBBBaoaB 


bB^iieeBbob 


05^2 


EEbBbE^BBb 


BBbBeozoBh 


BaDBEBBfi^Z 


OSI 


bEdBzoBBbb 


BBBBaBE303 


ccdbbBBedo 


02T 


ED3:iCCaD3B 


EC3qoaoa:j:) 


beebeb:!^^:: 


09 


zzoeEbbob^ 


:jZ3aBBaaEo 


:iaBEBEa66:j 



:aB: 



AST <00t> 

0 O'x'V = u <e22> 
(l'2^) "J (I) <222> 
sjnqEag dstui <T22> 
<022> 



PL9LP/66 OM 



wo 99/47674 



PCTAJS99/05798 



76 



cggaacagtg tggaagcaga aggccttttt aactcatccg tccgccaacc actgcaaaca 2040 
actgaaatgt ggatgtgatt gcctcaataa agctcgcccc cattgctcaa aaaaaaaaa 2099 

<210> 159 
<211> 291 
<212> PRT 
<213> Homo sapien 



<400> 159 



Met 


Asp Trp Gly Thr Leu His 


Thr 


Phe 


ile 


Gly 


Glv 


Val Asn 


Lys 


His 


1 




5 






10 








15 




Ser 


Thr Ser 


He Gly Lys Val 


Trp 


He 


Thr 


Val 


lie 


Phe He 


Phe 


Ara 






20 




25 








30 






Val 


Met lie 


Leu Val Val Ala 


Ala 


Gin 


Glu 


Val 




Gly Asp Glu 


Gin 




35 




40 










45 






Glu 


Asp Phe 


Val Cys Asn Thr 


Leu 


31n 


Pro 


Glv 




Lys Asn Val 


Cys 




50 


55 










60 








Tyr 


Asp His 


Phe Phe Pro Val 


Ser 


His 


lie 


Ara 


Leu 


Trp Ala 


Leu 


Gin 


65 




70 








75 








80 


Leu 


He Phe 


Val Ser Thr Pro 


Ala 


Leu 


Leu 


Val 


Ala 


Met His 


Val 


Ala 






85 






90 








95 




Ty-r 


Tyr Arg 


His Glu Thr Thr 


Arq 


—VS 

— jr 


Phe 


Ara 


Ara 

ru.y 


Gly Glu 


Lys 


Ara 






100 




105 








110 






Asn 


Asp Phe 


Lys Asp He Glu 


Asp 


- le 


Lvs 


Lvs 


Gin 


Lys Val 


Arg 


He 




115 




120 










125 






Glu 


Gly Ser 


Leu Trp Trp Thr 


Tyr 


Thr 


Ser 


Ser 


lie 


Phe Phe 


Arg 


lie 




130 


135 










140 








He 


Phe Glu 


Ala Ala Phe Met 


Tyr 


Val 


Phe 


Tyr 


Phe 


Leu Tyr Asn 


Gly 


145 




150 








155 








160 


Tyr 


His Leu 


Pro Trp Val Leu 


Lys 


Cys 


Gly 


lie 


Asp 


Pro Cys 


Pro 


Asn 






165 






170 








175 




Leu 


Val Asp Cys Phe He Ser 


Arg 


Pro 


Thr 


Glu 


Lys 


Thr Val 


Phe 


Thr 






180 




135 








190 






He 


Phe Met 


He Ser Ala Ser 


Val 


Zle 


Cys 


Met 


Leu 


Leu Asn 


Val 


Ala 




195 




200 










205 






Glu 


Leu Cys 


Tyr Leu Leu Leu 


Lys 


Val 


Cys 


Phe 


Arg 


Arg Ser 


Lys 


Arg 




210 


215 










220 








Ala 


Gin Thr Gin Lys Asn His 


Pro 


Asn 


His 


Ala 


Leu 


Lys Glu 


Ser 


Lys 


225 




230 








235 








240 


Gin 


Asn Glu 


Met Asn Glu Leu 


He 


Ser 


Asp 


Ser 


Gly 


Gin Asn 


Ala 


He 






245 






250 








255 




Thr 


Gly Ser Gin Ala Lys His 


Phe 


Lys 


Val 


Lys 


Cys 


Ser Cys Val 


He 






260 




265 








270 






Arg 


Arg Leu Leu Ser Ser Pro 


Glu 


Gly 


Asn 


Thr 


Asn 


Leu Lys 


Val 


Pro 




275 




280 










285 







Ser Val Ala 



290 

<210> 160 
<211> 3951 
<212> DNA 
<213> Homo sapien 

<400> 160 

tctgcaccca tactgaaaac ctgacacaa^ gtacgcagca ggctcagcgc gagtgaactg 60 
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ZStZ B2EC2a:iEae zzcz^zzeS^ ecaoeBoeaa a£aao6z36E eoee6eDc-6 aoae^acc-o 
ooee pe-nnSzaao ^220^222^^ ^v^oov^B^S ^B^zziBzo^^ ije-rjeDB-c:- 

0f^2e 55h3c56b66 eB^eoBBeoB SHOODoBeae eBe966a56e 6zaH:i:j:iaEq aa6BBe66BE 

081 e c-5a6EDDB6 63a5GBBa66 5ee566eBE- 56BBBa:iaDtj E:»BBq6eBH3 a:iD:iaeaaBE 

OSie BBBZBBaBBB obb6:ib3oB5 Z4Zdbdb:i:;3 oobbbbBbq^ BDBtiBfiBBBa -:i:ia04aBBB 

090e OBBOZlBfiBafi 5Z^D^22222 BfiBBaBBBOE aBBDBUBa:^! 22B^5Z2^DB aBBBBCiaBlB 

OOOe :;3BBBBOaB3 BBaaeSBOaB BBBDBBUOB:i BOBBBBBOB:! OBfioaaCOBB UBOOOBfiBBq 

01^62 B3B6e33oa:i 03a:;csBa6B bbcdubiibbb :iEBBnB3:iBa ubbbbdbbBB qBBfiBBBBBB 

0862 bbobdboBbB bBbbbbbBBb ooBBqqaoBZ BoaB3B3e6:j £a:iBBQBuaB QacoBaa:iBD 

0382 3BB6BB3e63 :5a665aBE06 BDBB^tianBB 66BBB6:jqB:i Ba:iDaB:^3BB bdbddBqoob 

09Z.3 nBacoaBBqo i^bbocoood :i:iB tin :ifi ^o:j odBoBBbooo 63:ibobb:io3 B:j6aDB3oaB 

OOLZ BDE^aOOaOB bBBb^bBBOE BOBeBDEaBE D6uaBaB333 BBBbDEOOBe BEBaBDBDBB 

0V9Z BfiBoBaBBBO OBbOUEOBBB aDDBBB^BEB OBDCaU:iBBE oooDEoao^a BDE^^qeaBfi 

0852 BB6BbD3EOB B^OBBBOBBD ZODIIEBbBdB EEEOaBOBaB BB36B3a3aB aoBDBBOBB^ 

0252 3:i3BBaB6BB PD^BaBBBEO EqcaBEBBBa BeB3EEBbB3 BBBB:iBa36B B0B30BBB0D 

09^2 BBBEDaBBaq aoBfiBEBBBB 330ed6bdbB B^ao^Etiooo Bfia^BEBBEB BEBeaBBBEE 
001^2 ' 36aDBBBBBa OOBBa^BaiE BBEOB^EDDE DOUai^BqB^E B^ODDBDOOO DBB^oBEDOtJ 

0l^e2 qBBBBBaoBa BEo^aa^ooa dBBbBBeduo BBoaSBfiooB Ea^aoBBBBa BebbBoBeBB 

0822 bBBeB^bebB EDBBEaBEoq beeBBbedoq oBriBEBaBBB 3:i3E3BEiBB debbdBedbd 

0222 E^riBBBOOEq B^EaBaBaofi oeouBeBBBe ddz21£2D2D^ codBbooodb o6bb::eqBbo 

09T2 cc:;oaoB03E EoaBjBoBaB EBBfiaaoBB:; b^bBe^BB^b EBoBaoBa^q 0030^33333 

0012 E3dBbB030B 333EB6B3EB 3EBEBEE3B3 3B3BB30B3B BboBbBB3BB 3E6330030E 

0^02 BBfi30B0E33 63003eBeBB -OEBeBeOOB EB326E3EOO B30E03fi30E O0B3BB33O3 

086T 3E0OO3B333 3BBfiBEOBBE B3B3EBOOB3 E33ZBB3EB3 B30O3E0300 3333BOD300 

0261 BeoeBeBbbe BB3B3Z300B BEB63630B0 oBEoooooBq 63o6EOaOBE D030060B03 

0981 030OB63BEO BB3BEE6B3O OOBEBOB303 O33BO3E0CO E3BEOBE630 OOBOE330BB 

0O8T E3oeoBBB3o oBeb306boe bBBboo33w5 633Z036b30 5bobSBd333 :;obb30-4Eeo 

Otil OB03B3333B E3BEEOBOE3 OE3EBEB6oe DB3eBZ0C3B B333B33B3B 33BBEfc3003 

G89T OO3BB36BOO 6BEO££36oE 33BF303Z35 3B--;0BObB0E B06BB3fi33E 3BE3EB/33B3 

0291 0B635BOEOE BEEE£33ceO qE0DB330CP EE03B3EEEE B3d6bOBuBb BEB333Be04 

0951 aE3B3EE0BB 333333EOEB e6B33BB653 3330333EeB E3BB333336 3BB33E63B3 

OOST BB3EB3333E BE33E3B3ED E0043B3333 33BBEB-433B Bb6BB06333 3£oB33E-^3B 

Ol^M EB6B6B303E EB33C3BBOB 333B033£Bd :;030£33E00 33E333EB3B B333B636E3 

Oeei 6E333D3B33 B303B333Bq 3BB3BB-30- 33BbE3EB3E BeoBoBbOOE 63BE33B3E6 

02eT 3E63B30336 B3B333BEEB eBB3EBB33E B6BEB33BB3 B3EB333B6fi EBBBeE333D 

C92T 66E333B333 B3BE33B3BB E0BEBE30Bb 03E3533B30 E333B333B3 B3333BB306 

CC2T -35eEE603E B3EB3EB06e 3EB33EBB30 E3E336E033 6eBE03BBeB b6BeEE36B3 

Ot-IT b£3333BE33 B33B3DBB3B 33330E3E33 3BBE633B33 bBe3B3EB33 3B3333EEBb 

C80T OCOOBeBOBE 3E33EB3333 330BBB3E53 3BBeBe3BB3 b6eB3BE033 63B3BBB306 

0201 3BB333B333 B63BBEB3EB 3BB33B6BBE 3B3Z3B0333 3B3E333330 333E303336 

096 • BB30EBBB3E EB3E03Oq33 3BB0B03B33 333BB33B33 30eBE0B03E B3B3E6BB3E 

006 3C3BbeBB03 03BeoB363E BE30BeBB3B 303BEE30B3 Beb6eB03EE OE333E3BbB 

0^8 3B3BE3B333 3EB633BB3B 333333B333 BBEB3B3E33 363EE3BE33 B3fi33EB3B3 

08^ EEBB300E36 EqEB3E333B 33300E3B3B BBebBbeB33 3333BbE3BE 33E33B3B33 

02Z. BBEBBeEOOO 03B33333fiB EBEBEB3B3B 3B33333B36 BB3B33E3B6 333E333B3B 

099 6eE3eB36BE B33EEB33EB BB0BBB3EBE 3EOE303330 OBEB3B63BB 3EE3B3BeB3 

009 B6333B3B3B B6B3363333 3E303BB63E BB3E033B33 3636bB30BB B6oE03EBBo 

O^S E336B33BE3 BBa33EE3BB 3EB633E330 333EB300BO B3333E333E 3E3BEEBBBB 

08fr b5bBEEBB3B 3BBBb6B3B3 EB3E3303E0 EaE333BB3E fiEB33B3E3B B663E3BB33 

021^ BE33eB3BB3 B3363BBE36 BbeEBB3B3B 33BBBbE3BE EB3EBBB3Be 3BE3EB3EB3 

09e 3BEEBBB3B3 B33B330B3E B3333BBeBB 3E3EEBBB33 33a3B3BBBB b6eBEB30B3 

OOe 3B3BE333B3 30E3343E33 asBEBBzOEE 3EB3EEEBBb B33B3EEB33 33B0333BBB 

0^2 E33EbBbB33 3BaBBB0333 3EE33EB3B3 3B3-3B33EB B3EE3E36BB 3BB3BBbB33 

08T 30BEOE3BeB B30BBBBB33 033B33033B eBe033BbB3 300£33B533 03333eB363 

OcT 33£EBfi333E E36333B333 3BBE3B::3E0 BbBBbbEOOO BB3B3Ey3B3 333333BBeB 



LL 



86tS0/66Sn/lDd 



wo 99/47674 



PCTAJS99/05798 



78 



tacgaagccc ctaacgcaaa gctcttcacc tcctgctatt ttgccatata tattacagat 3420 

gaaacctcac tgccaacgct cagagatctt ctttcactgt aagaggtaac ctttaacaat 3480 

atgggtacta ccurcgcccc ttcacaccgg ttctacgaca aaggcctact gaattcatct 3540 

gtccgtaagc ttccactccc atcaaagcag cctttcaagt tatcgcctcg gttatcacgg 3600 

atgacagtta tagcccttat aatgccctaa ccaaggaaga aaagacgtta ctccgagctt 3660 

gttctaatac atacatgaac atatagcttt attcaattaa accaaagaag aggtcagcag 3720 

ggagatacta acctctggaa atgattagct ggctcrgttt tttggtcaaa taagagtctt 3780 

caatcctctc tccatcaaga gctactcacc aagggcaggg gaagggggat atagaggtcc 3840 

caaggaaaca aaaaccacct ttcatcctta attctactcc tccctcttat cctcctaaaa 3900 

gattatcgaa caacaaaatc atttgccttt ctaattaaaa acataaaaaa a 3951 

<210> 161 

<211> 943 

<212> PRT 

<213> Hcmo sapien 

<400> 161 

Met Thr Gin Arg Ser lis Ala Gly Pro lie Cys Asn Leu Lys Phe Vai 

^ 5 10 * 15 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Gly Ala Gly 

20 25 30 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu He Ala He Asn 

35 40 * 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn lie Lys Glu Met 

50 55 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arg Vai 
5^ 70 '75 ' 80 " 

Phe Phe Arg Asn He Lys He Leu He Pro Ala Thr Trp Lys Ala Asn 

85 90 " 95 

Asn Asn Ser Lys He Lys Gin Glu Ser Tyr Glu Lys Ala Asn Val He 

100 105 110 

Val Thr Asp Trp Tyr Gly Ala His Gly Asp Asp Pro Tyr Thr Leu Gin 

115 120 " 125 

Tyr Arg Gly Cys Gly Lys Glu Gly Lys Tyr He His Phe Thr Pro Asn 

130 135 140 

Phe Leu Leu Asn Asp Asn Leu Thr Ala Gly Tyr Gly Ser Arg Gly Arg 
145 150 155 160 

Val Phe Val His Glu Trp Ala His Leu Arg Trp Gly Val Phe Asp Glu 

165 170 175 

Tyr Asn Asn Asp Lys Pro Phe Tyr He Asn Gly Gin Asn Gin He Lys 

180 185 190': 

Val Thr Arg Cys Ser Ser Asp He Thr Gly He Phe Val Cys Glu Lvs 

195 200 205 

Gly Pro Cys Pro Gin Glu Asn Cys He He Ser Lys Leu Phe Lys Glu 

210 215 220 

Gly Cys Thr Phe He Tyr Asn Ser Thr Gin Asn Ala Thr Ala Ser He 
225 230 235 240 

Met Phs.Met Gin Ser Leu Ser Ser Val Val Glu Phe Cys Asn Ala Ser 

245 250 255 

Thr His Asn Gin Glu Ala Pro Asn Leu Gin Asn Gin Met Cys Ser Leu 

260 265 270 

Arg Ser Ala Trp Asp Val He Thr Asp Ser Ala Aso Phe His His Ser 

275 280 * 285 

Phe Pro Met Asn Gly Thr Glu Leu Pro Pro Pro Pro Thr Phe Ser Leu 
250 295 300 
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I^A -SS eqc -ss ^10 
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. OOZ. 

311 STH HIV OJCd 
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OLS 
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SS9 
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039 
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S09 

HTO T^A Jqi exv oad 
065 

:cqx T^A SAT naq eiv 

jqx dax STH Ato oad 
095 

6tv sqd Jqx naq usv 
OfrS 

dsv ojd asv Slid naq 
535 

XBA nsq aqd :isw ^MX 
015 

usv sXq nsT uxo sth 

UTO exi STH UXG uxo 
08^ 

©Md HXV Qsv an aaw 
09^ 

sXq nsT AxO Axe aqi 
5p^ 
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ttctactcct tccrcttact cccttaaaag actatcgaac aacaaaacca tttgcctttt 3360 



<210> 168 
<211> 2784 
<212> DNA 
<213> Homo sapien 

<400> 168 

tctgcatcca cattgaaaac ccgacacaac gcacgcagca ggctcagtgt gagtgaactg 60 

gaggcttctc tacaacatga cccaaaggag cattgcaggt cccatttgca acctgaagtt 120 

tgcgactctc ccggctgcct taagttcaga actcccatcc ctgggagctg gagtacagct 180 

ccaagacaat gggcataatg gattgcccat cgcaaccaat cctcaggtac ctgagaatca 240 

gaaccccacc tcaaacatca aggaaatgat aactgaagct tcactttacc cattcaatgc 300 

taccaagaga agagtattct tcagaaatac aaagatctta acacctgcca catggaaagc - 360 

taataataac agcaaaacaa aacaagaacc atatgaaaag gcaaatgcca tagtgactga 420 

ctggcatggg gcacatggag atgatccata caccccacaa tacagagggc gtggaaaaga 480 

gggaaaatac atccacttca cacccaactc cctaccgaac gataacccaa cagccggcta 540 

cggaccacga ggccgagtgt ttgtccatga acgggcccac ctccgctggg gtgcgcccga oOO 

cgagcacaac aacgacaaac ctctccacac aaacgggcaa aaccaaatca aagtgacaag 660 

gtgtccatcc gacatcacag gcactctcgt gcgcgaaaaa ggcccttgcc cccaagaaaa 720 

ccgtattact agtaagctcc ctaaagaagg acgcaccttt atctacaata gcacccaaaa 780 

tgcaaccgca tcaataatgt tcacgcaaag -ccatcttct gtggttgaat tttgcaacgc 840 

aagcacccac aaccaagaag caccaaacct acagaaccag acgtgcagcc ccagaagcgc 900 

atgggatgca atcacagact ccgccgacrt tcaccacagc ctccccatga acgggactga 960 

gcctccscct ccccccacac ccccgczcgt agaggrtggt gacaaagtgg ccrgctcagt 1020 

gcrggatgtg tccagcaaga cggcagaggc rgacagactc cttcaaccac aacaagccgc 1080 

agaactttat tcgacgcaga ccgtcgaaat icatacctcc gcgggcaccg ccagt-ircga 1140 

cagcaaagga gagatcagag cccagctaca ccaaactaac agcaatgacg atcgaaagr:- 1200 

gccggtttca tatctgccca ccactgtacc . agctaaaaca gacatcagca tttgttcagg 1260 

gcntaagaaa ggatttgagg cggtcgaaaa accgaatgga aaagctcacg gctccgtgac 1320 

gacactagtg accagcggag atgataagct cctcggcaac tgcctaccca ctgcgcccag 1390 

cagcggctca acaattcact: ccattgccct gggttcatct gcagccccaa atctggagga 1440 

atcatcacgc cccacaggag gtttaaagct ccctgcccca gatacaccaa accccaacag 1500 

catgattgat gctttcagta gaatttcctc rggaaccgga gacactctcc agcaacatac 1560 

tcagcttgaa agtacaggtg aaaacgtcaa acctcaccat caactgaaaa acacagcgac 1620 

tgtggataat actgtgggca acgacaccat gtttctagtt acgtggcagg ccagtggtcc 1680 

tcctgagact atactattcg accctgacgg acgaaaatac cacacaaaca attttatcac 1740 
caacctaacc tttcggacag ctagtccttg gattccagga acagctaagc ccgggcactg . 1800 

gacttacacc ctgaacaata cccatcattc tccgcaagcc ctgaaagcga cagtgacctc 1860 

tcgcgcctcc aacccagccg tgcccccagc cactgtggaa gcctttgtgg'aaagagacag 1920 

cccccattcc cctcatcctg tgatgattta tgccaacgcg aaacagggac tttatcccat 1980 

tcttaatgcc actgtcaccg ccacagttga gccagagacc ggagatcctg ttacgctgag 2040 

ac::cctcgat gatggagcag gtgccgatgt cataaaaaat gatggaattt actcgaggta 2100 

ttctttctcc tctgctgcaa atggtagaca cagcttgaaa gtgcatgcca accaccctcc 2160 

cagcataagc accccagccc actctacccc agggagtcat gctatgtatg taccaggtca 2220 

cacagcaaac ggcaatattc agatgaatgc tccaaggaaa tcagtaggca gaaatgagga 2280 

ggagcgaaag tggggcttca gccgagtcag ctcaggaggc ccccticcag tgctgggagc 2340 

tccagccggc ccccaccctg atgtgctccc accatgcaaa accaccgacc tggaagctgc 2400 

aaacagaaga ggaattgacc ctatcccgga cagcacccgg agaagacct- gatcagggcr 2460 

aggctacaag ctatgaaaca agaatgagta aaagtccaca gaacacccaa gacgacctta 2520 

acaatgctac tttagtaaar acatcaaagc gaaaccctca gcaagctggc atcagggaga 2580 

tatttacgtt ctcaccccaa atttccacga acggacccga acaccagcca aatggagaaa 2640 

cacacgaaag ccacagaact catgctgcaa racgagcaac ggacaggaac cccccacagi 2700 

ccgctgtatc taacaccgcc caggcgcccc cgtccacccc crccaacccr gaccctgca- 2760 
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370 375 380 

Ser Ala Lys Thr Asp He Ser He Cys Ser Gly Leu Lys hys Gly Phe 
385 390 395 400 

Glu Vai Val Glu Lys Leu Asn Gly Lys Ala Tyr Gly Ser Val Met He 

405 410 415 

Leu Val Thr Ser Gly Asp Asp Lys Leu Leu Gly Asn Cys Leu Pro Thr 

420 425 430 

Val Leu Ser Ser Gly Ser Thr He His Ser He Ala Leu Gly Ser Ser 

435 440 445 

Ala Ala Pro Asn Leu Glu Glu Leu Ser Arg Leu Thr Gly Gly Leu Lys 

450 455 460 

Phe Phe Val Pro Asp He Ser Asn Ser Asn Ser Met He Asp Ala Phe 
465 470 475 480 

Ser Arg He Ser Ser Gly Thr Gly Asp He Phe Gin Gin His He Gin 

485 490 455 

Leu Glu Ser Thr Gly Glu Asn Val Lys Pro His His Gin Leu Lys Asn 

500 505 510 

Thr Vai Thr Val Asp Asn Thr Val Gly Asn Asp Thr Met Phe Leu Val 

515 520 ' 525 

Thr Trp Gin Ala Ser Gly Pro Pro Glu He He Leu Phe Aso Pro Aso 

530 535 540 

Gly Arg Lys Tyr Tyr Thr Asn Asn Phe He Thr Asn Leu Thr Phe Arg 
545 550 555 sSO 

Thr Ala Ser Leu Trp He Pro Gly Thr Ala Lys Fro Gly His Trp Thr 

565 570 575 

Tyr Thr Leu Me- Cys Phe His His Ala Lys Leu Leu Thr Trp Lys Leu 
580 535 5S0 

<210> 170 
<2H> 791 
<212> PRT 
<213> Homo sapien 

<400> 170 

Met Thr Gin Arg Ser He Ala Gly Pro He Cys Asn Leu Lys Phe Val 

15 10 15 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Gly Ala Gly 

20 25 30 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu He Ala He Asn 

35 40 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn He Lys' Glu Met 

50 55 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arg Val 
65 70 75 80 

Phe Phe Arg Asn He Lys He Leu He Pro Ala Thr Trp Lys Ala Asn 

85 90 95 

Asn Asn Ser Lys He Lys Gin Glu Ser Tyr Glu Lys Ala Asn Val He 

100 105 110 

Val Thr Asp Trp Tyr Gly Ala His Gly Asp Asp Pro Tyr Thr Leu Gin 

115 120 125 

Tyr Arg Gly Cys Gly Lys Glu Gly Lys Tyr He His Phe Thr Pro Asn 

130 135 ' 140 

Phe Leu Leu Asn Asp Asn Leu Thr Ala Gly Tyr Gly Ser Arg Gly Arg 
145 150 155 

Val Phe Val His Glu Trp Ala His Leu Arg Trp Gly Val Phe Asp Glu 
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Val 
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Arg Asp Ser Leu His Phe Pro 


His Pro Val Met 


He 




610 








615 


620 




Tyr 


Ala 


Asn 


Val 


Lys Gin Gly Phe Tyr Pro He 


Leu Asn Ala Thr 


Val 


625 










630 635 




640 


Thr 


Ala 


Thr 


Val 


Glu 


Pro Glu Thr Gly Asp Pro 


Val Thr Leu Arg 


Leu 










645 


650 


655 




Leu 


Asp 


Asp Gly Ala Gly Ala Asp Val He Lys 


Asn Asp Gly He 


Tyr 








660 




665 


670 




Ser 


Arg 


Tyr 


Phe 


Phe 


Ser Phe Ala Ala Asn Gly 


Arg Tyr Ser Leu 


Lys 






675 






680 


685 
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Val 
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Ser Pro Ser He Ser Thr 


Pro Ala His Ser 


He 




690 








695 


700 
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Met Tyr Val Pro Gly Tyr 


Thr Ala Asn Gly 


Asn 


705 










710 715 




720 


He 


Gin 


Met 


Asn 


Ala 


Pro Arg Lys Ser Val Gly 


Arg Asn Glu Glu 


Glu 










725 


730 


735 




Arg 


Lys 


Trp Gly 


Phe 


Ser Arg Val Ser Ser Gly 


Gly Ser Phe Ser 


Val 








740 




74 5 


750 




Leu 


Gly 


Val 


Pro 


Ala 


Gly Pro His Pro Asp Val 


Phe Pro Pro Cys 


Lys 






755 
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He 


He 
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Leu 


Glu 


Ala Val Asn Arg Arg Gly 


He Asp Pro He 


Leu 




770 








775 


780 
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Ser 


Thr 
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Arg Arg Leu 







785 790 



<210> 171 

<211> 1491 

<212> DNA 

<213> Homo sapien 

<400> 171 

cctcccgcca gccaagtgaa gacatgctta rttccccttc accttccttc atgatgtggg 60 

aagagtgctg caacccagcc ctagccaacg ccgcatgaga gggagtgtgc cgagggcccc 120 

cgagaaggct tctcccacat ctagaaagaa gcgcttaaga tgtggcagcc cccctccctc 180 

aagtggctct tgtcccgttg ccctgggagc cctcaaattg ctgcagcagc ctccacccag 240 

cctgaggacg acaccaatac acagaggaag aagagtcagg aaaagatgag agaagctaca 300 

gactctcctg ggcgaccccg agagcttacc attcctcaga ctccttcaca tggcgctaac 360 

agatttgttc ctaaaagtaa agctctagag accgtcaaat tggcaataga agccgggttc 420 

caccatattg attctgcaca tgtttacaat aatgaggagc aggttggact ggccacccga 480 

agcaagattg cagatggcag tgtgaagaga gaagacatat tctacacttc aaagctttgg 540 

agcaattccc atcgaccaga gttggtccga ccagccttgg aaaggtcact gaaaaatctt 600 

caattggact atgctgacct ctatcttatt cattttccag tgtctgtaaa gccaggtgag 660 

gaagtgatcc caaaagatga aaatggaaaa atactatttg acacagtgga tctctgtgcc 720 

acatgggagg ccacggagaa gtgtaaagat gcaggattgg ccaagtccat cggggtgtcc 780 

aacttcaacc acaggctgcc ggagatgatc ctcaacaagc cagggctcaa gtacaagcct 840 

gtctgcaacc aggtggaatg tcacccttac ttcaaccaga gaaaactgct ggatttccgc 900 

aagtcaaaag acactgctcc ggttgcctac agtgctctgg gatcccatcg agaagaacca 960 

tgggtggacc cgaacccccc ggtgctcttg gaggacccag tcctttgtgc cttggcaaaa 1020 

aagcacaagc gaaccccagc cctgattgcc crcgcgccacc agctgcagcg tggggttgtg 1C80 

gtcctggcca agagctacaa tgagcagcgc atcagacaga acgcgcaggi: gcttgaattc 1140 

cagctgactt cagaggagat gaaagccata gatggcctaa acagaaacgt gcgatactcg 1200 

accctcgata tttttgccgg cccccctaac ratccatttt ctgatgaata ttaacacgca 1260 

gggcactgca tgaggcctgc cagaaggccc rgcgcgtgga tggtgacaca gaggacggcc 1320 

ctatgcrggt gaccggacac atcgcctctg gttaaatctc ccctgctcgg cgactccagt 138C 

aagctacagc taagcccatc ggccggaaaa gaaagacaat aatcttgcct tccatctcga 1440 
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sth aas :tas ^yx uio oad an ayx naq nio Bav cad Bov Aio bad ^as 

S^ 0^ SZ 

dsv Jqx IBA nio Bav :iaw sAq nio uio aas sAq sAq 6av uio JUX usv 

Oe S2 02 

an dsv dsv nxo oad uxo aqx :ias biv biv biv biv ail uio aas aas 

ST OX SI 

-^10 oad sAo sAo aas naq naq dax sAq aqd aqd naq oad uio dax 3aw 

2Z.X <00^> 

uaTdBS ouiOH <EX2> 
XHd <2I2> 
^SZ <II2> 
2Z.X <0X2> 

X6^X B BBBBBBBBBB BBBB^DOBO:! nO^UBBBBB^ OD:i3aaao6q BBB^^BBBBB 
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